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AUSTRALIA 
Patents Act 1990 

CSL LIMITED 
PROVISIONAL SPECIFICATION 

Invention Title: 

P. gingivalis nucleotides and polypeptides 
The invention is described in the following statement: 
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P. gingivalis nucleotides cuid polypeptides 

FIELD OF THE INVENTION 

5 The present invention relates to P. gingivalis nucleotide sequences, P. 

gingivalis polypeptides and probes for detection of P. gingivalis, 

BACKGROUND OF THE INVENTION 

10 Periodontal diseases are bacterial-associated inflammatory diseases of 

the supporting tissues of the teeth and range from the relatively mild form of 
gingivitis, the non-specific, reversible inflammation of gingival tissue to the 
more aggressive forms of periodontitis which are characterised by the 
destruction of the tooth*s supporting structures. Periodontitis is associated 

15 with a subgingival infection of a consortium of specific Gram-negative 
bacteria that leads to the destruction of the periodontium and is a major 
public health problem. One bacterium that has attracted considerable 
interest is P. gingivalis as the recovery of this microorganism from adult 
periodontitis lesions can be up to 50% of the subgingival anaerobically 

20 cultivable flora, whereas P. gingivalis is rarely recovered, and then in low 
numbers, from healthy sites. A proportional increase in the level of 
P. gingivalis in subgingival plaque has been associated with an increased 
severity of periodontitis and eradication of the microorganism from the 
cultivable subgingival microbial population is accompanied by resolution of 

25 the disease. The progression of periodontitis lesions in non-human primates 
has been demonstrated with the subgingival implantation of P. gingivalis. 
These findings in both animals and humans suggest a major role for P. 
gingivalis in the development of adult periodontitis. 
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P. gingivalis is a black-pigmented, anaerobic, asaccharolytic, 
proteolytic Gram-negative rod that obtains energy from the metabohsm of 
specific amino acids. The microorganism has an absolute growth 
requirement for iron, preferentially in the form of haeme or its Fe(III) 
5 oxidation product haemin and when grown under conditions of excess 

haemin is highly virulent in experimental animals. A number of virulence 
factors have been implicated in the pathogenicity of P. gingivalis including 
the capsule, adhesins, cytotoxins and extracellular hydrolytic enzymes. 
In order to develop an efficacious and safe vaccine to prevent 
10 P. gingivalis colonisation it is necessary to identify and produce antigens that 
are involved in virulence that have utility as immunogens to generate 
neutiralising antibodies. Whilst it is possible to attempt to isolate antigens 
directiy from cultures of P. gingivalis this is often difficult. For example as 
mentioned above, P. gingivalis is a strict anaerobe and can be difficult to 
15 isolate and grow. It is also known tiiat, for a number of organisms, when 
cultured in vitro that many virulence genes are down regulated and ihe 
encoded proteins are no longer expressed. If conventional chemistiy 
techniques were applied to purify vaccine candidates potentially important 
(protective) molecules may not be identified. Witii DNA sequencing, as tiie 
20 gene is present (but not ti-anscribed) even when tiie organism is grown in 
vitiro it can be identified, cloned and produced as a recombinant DNA 
protein. Similarly, a protective antigen or therapeutic target may be 
tiansientiy expressed by tiie organism in vitro or produced in low levels 
making the identification of these molecules extiemely difficult by 
25 conventional methods. 

With serological identification of therapeutic targets one is limited to 
those responses which are detectable using standard methods such as 
Western Blotting or EUSA. The limitation here is the both the level of 
response that is generated by tiie animal or human and determining whether 
30 this response is protective, damaging or irrelevant. No such limitation is 



4 



present with a sequencing approach to the identification of potential 
therapeutic or prophylactic targets. 

It is also well known that P. gingivolis produces a range of broadly 
active proteases (University of Melbourne International Patent Application 
5 No PCT /AU 96/00673, US Patent Nos 5,475,097 and 5,523,390), which make 
the identification of intact proteins difficult because of their degradation by 
these proteases. 

SUMMARY OF THE INVENTION 

10 

The present inventors have attempted to isolate P. gingivolis nucleotide 
sequences which can be used for recombinant production of P. gingivolis 
polypeptides and to develop nucleotide probes specific forP. gingivolis. The 
DNA sequences listed below have been selected from a large number of P. 

15 gingivolis sequences according to their indicative potential as vaccine 

candidates. This intuitive step involved comparison of the deduced protein 
sequence from the P. gingivolis DNA sequences to the known protein 
sequence databases. Some of the characteristics used to select useful vaccine 
candidates include; the expected cellular location, such as outer membrane 

20 proteins or secreted proteins, particular functional activities of similar 
proteins such as those with an enzymatic or proteol3rtic activity, proteins 
involved in essential metabolic pathways that when inactivated or blocked 
may be deleterious or lethal to the organism, proteins that might be expected 
to play a role in the pathogenesis of the organism eg. red cell lysis, cell 

25 agglutination or cell receptors and proteins which are paralogues to proteins 
with proven vaccine efficacy. DNA sequences that were considered to be 
poor vaccine candidates and not selected include those that code for proteins 
involved in replication, non-essential proteins involved in cellular processes 
and those proteins present at sites that would be unlikely to be affected by 
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immune mediators such as those found in the bacterial cytoplasm or inner 
membranes. 

In a first aspect the present invention consists in an isolated 
P. gingivalis nucleotide sequence, the nucleotide sequence consisting of or 
5 including a sequence selected from the group consisting of SEQ ID NO: 1 
to 11 fragments thereof and sequences complementary thereto. 

In a second aspect the present invention consists in an isolated 
P. gingivalis polypeptide, the polypeptide being at least partially encoded by 
a nucleotide consisting of or including a sequence selected from the group 
10 consisting of SEQ ID NO: 1 to 11, fragments thereof and sequences 
complementary thereto. 

In a third aspect the present invention consists in an isolated 
P. gingivalis polypeptide consisting of or including an amino acid sequence 
selected from the group consisting of SEQ ID NO: 12 to 33 or fragments 
15 thereof. 

In a fourth aspect the present invention consists in an isolated 
polynucleotide, the polynucleotide encoding a polypeptide consisting of or 
including an amino acid sequence selected from the group consisting of 
SEQ ID NO: 12 to 33 or fragments thereof. 

20 In a fourth aspect the present invention consists in a nucleotide probe 

specific for P. gingivalis, the probe including a detectable label and a 
nucleotide sequence of at least 15 nucleotides, the nucleotide sequence being 
derived from a sequence selected from the group consisting SEQ ID NO: 1 
to 11 and sequences complementary thereto. 

25 In a fifth aspect the present invention consists in a composition for use 

in raising an inunune response in an animal directed against P. gingivalis the 
composition including an acceptable carrier and/or adjuvant and at least one 
polypeptide having a sequence selected from the group consisting of SEQ ID 
NO: 12 to 33 and fragments thereof. 
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As will be understood by those skilled in the art the nucleotides of the 
present invention may be useful in DNA vaccination to reduce the incidence 
and/or severity of P. gingivalis infection. 

Accordingly in a sixth aspect the present invention consists in a 
5 composition for use in inducing an immune response, the composition 

including at least one DNA molecule, the at least one DNA molecule having 
or including a sequence selected from the group consisting of SEQ ID NO: 
1 to 11 fragments thereof and sequences complementary thereto. 

Further information regarding DNA vaccination may be found in 
10 Donnelly et al. Journal of Immunological Methods 176(1994) 145-152, the 
disclosure of which is incorporated herein by reference. 

Throughout this specification, unless the context requires otherwise, 
the word "comprise", or variations such as "comprises" or "comprising", will 
be understood to imply the inclusion of a stated element or integer or group 
15 of elements or integers but not the exclusion of any other element or integer 
or group of elements or integers. 

DETAILED DESCRIPTION 

20 Preparation of the P. gingivalis library for sequencing. 

To determine the DNA sequence of P. gingivalis genomic DNA was isolated 
from P. gingivalis strain W50 (ATCC 53978) essentially by the method 
described by Mamur J. (1961)(1). Cloning of DNA fragments was performed 

25 essentially as described by Fleischmann et al., (1995)(2). Briefly, purified 

genomic DNA from P. gingivalis was nebulized to fragment the DNA and was 
treated with Bal31 nuclease to create blunt ends then run twice on 
preparative 1% agarose gels. DNA fragments of 1.6-2.0 kb were excised from 
the gel and the DNA recovered. This DNA was then ligated to the vector 

30 pUCl8 [Smal digested and dephosphorylated; Pharmacia) and 
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electrophoresed on a 1% agarose preparative gel. The fragment comprising 
linear vector plus one insert was excised, purified and this process repeated 
to reduce any vector without insert contamination. The recovered vector 
plus insert DNA was blunt-ended with T4 DNA polymerase, then a final 
ligation to produce circular DNA was performed. Aliquots of Epicurian Coli 
Electroporation-Competent Cells (Stratagene) were transformed with the 
library DNA and plated out on SOB agar antibiotic diffusion plates 
containing X-gal and incubated at 37°C overnight. Colonies with inserts 
appeared white and those without inserts (vector alone) appeared blue. 
Plates were stored at 4°C until the white clones were picked and expanded 
for the extraction of plasmid DNA for sequencing. 



DNA sequencing 

Plasmid DNA was prepared by picking bacterial colonies into 1.5ml of 
15 LB, TB or SOB broth supplemented with 50-lOOug/ml Ampicillin in 96 deep 
well plates. Plasmid DNA was isolated using the QIAprep Spin or QIAprep 
96 Turbo miniprep kits (QIAGEN GmbH, Germany). DNA was eluted into a 
96 well gridded array and stored at -20C. 

Sequencing reactions were performed using ABI PRISM Dye 
20 Terminator and ABI PRISM BIGDye Terminator Cycle Sequencing Ready 
Reaction kits with AmpliTaq DNA polymerase FS (PE Applied Biosystems, 
Foster City, CA) using the M13 Universal forward and reverse sequencing 
primers. Sequence reactions were conducted on either a Perkin-Elmer 
GeneAmp 9700 (PE Applied Biosystems) or Hybaid PGR Express (Hybaid. 
UK) thermal cyclers. Sequencing reactions were analysed on ABI PRISM 377 
DNA sequencers (PE Applied Biosystems). 

The sequences obtained are set out below. 

SeqID#l-ll represents the DNA sequence that encodes proteins in 
SeqID#12-33. Proteins in SeqID#12-22 are the entire open reading frame 
30 from DNA SeqID#l-ll. Proteins in SeqID#23-33 are the proteins encoded by 
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DNA SeqID#l-ll from their putative intitiation codon. The initiation codon 
was calculated from sequence homology alignment using FastX or by the 
ORF prediction program GeneMark. 

As will be understood by those skilled in the art open reading frames 
(ORFs) may be readily identified. ORFs may be determined using two 
methods, for example, alignments from FastX search results may be used to 
define the start and end positions of coding regions if sufficient protein 
homology is present. Alternatively, protein coding regions may be identified 
using the ORF identification progam GeneMark (3) using a training matrix 
based on published P.gmgivalis sequences. This matrix may be further 
refined by adding ORFs identified from the results of homology searching 
and ORFs identified by GeneMark. The program PSORT (4) may be used for 
the detection of signal sequences and the prediction of protein cell 
localisation. A UNIX version of TopPred (5] may be used to identify 
potential membrane spanning domains. 

DNA sequence analysis 

Raw trace data files from the ABI 377 sequencer were manually 
trinmied using Staden Pregap(Laboratory of Molecular Biology, Medical 
Research Council, UK) running on a Sun Microsystem computer. Trimmed 
files were assembled into contigs using Staden Gap v4.1 and exported as a 
consensus file in FastA format. This consensus was converted into GCG 
format files and analysed for homology using the FASTX algorithm on a non- 
redundant protein database compiled by ANGIS (Australian Genomic 
Information Service, University of Sydney). Individual FASTX search results 
were examined for significant homology by statistical probability and amino 
acid alignments. 

The results are set out in Table 1. 
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It will be appreciated by persons skilled in the art that numerous 
variations and/or modifications may be made to the invention as shown in 
the specific embodiments without departing from the spirit or scope of the 
invention as broadly described. The present embodiments are, therefore, to 
5 be considered in all respects as illustrative and not restrictive. 

Dated this twenty-ninth day of July 1998 

CSL LIMITED 

Patent Attorneys for the Applicant: 
F B RICE & CO 
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(2) INFORMATION FOR SEQ ID NO : 1 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2547 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : ci r cul a r 

10 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

15 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

20 (ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION l..,2547 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 

25 



TTCGGAATAT 


CACCTTCAAT 


GAAAAAAAGT 


TTTCTTTTAG 


CCATAGTAAT 


GCTCTTTGGC 


60 


ATTGCCATGC 


AGGGACATTC 


TGCTCCGGTT 


ACGAAAGAGC 


GAGCTTTGAG 


TCTGGCTCGG 


120 


CTGGCTTTGC 


GACAGGTATC 


CTTGCGAATG 


GGACAAACAG 


CAGTATCTGA 


C7\AGATTTCC 


180 


ATCGATTACG 


TTTATCGGCA 


AGGAGATGCT 


GAGAGGGGTA 


TCACATCACA 


AGAGGAAGGC 


240 


TCTCCTGCAT 


ATTTTTATGT 


AGCTAATCGT 


GGAAATAATG 


AGGGCTATGC 


TCTTGTAGCA 


300 


GCAGATGACA 


GAATACCGAC 


AATTTTAGCC 


TATTCACCCA 


TTGGCCGTTT 


CGACATGGAC 


360 


AGTATGCCGG 


ACAATCTTCG 


CATGTGGCTA 


CAAATTTACG 


ATCAGGAAAT 


AGGCCTGATA 


420 


CTTTCCGGAA AAGCTCAGCT 


CAATGAAGAG 


ATATTACGTA 


CCGAGGGCGT 


ACCGGCTGAA 


480 


GTACATGCTC 


TGATGGATAA 


CGGTCATTTT 


GCCAACGATC 


CCATGCGATG 


GAATCAAGGT 


540 


TACCCATGGA ACAATAAGGA ACCACTGCTT 


CCTAATGGCA 


ATCATGCCTA 


TACCGGCTGT 


600 


GTTGCTACTG 


CTGCAGCACA 


AATCATGCGC 


TACCATAGCT 


GGCCGCTTCA 


AGGTGAAGGC 


660 


TCTTTCGATT 


ATCATGCAGG 


TTCATTAGTT 


GGCAACTGGT 


CCGGCACATT 


TGGTGAAATG 


720 


TACGACTGGA 


TCAATATGCC 


CGGAAATCCC 


GACCTTGATA 


ATCTGACTCA 


ATCTCAAGTG 


780 


GATGCCTACG 


CCACACTGAT 


GCGTGATGTG 


AGTGCATCTG 


TTTCGATGAG 


TTTTTATGAA 


840 


AATGGAAGTG 


GTACGTACAG 


CGTTTATGTA 


GTAGGAGCCT 


TGCGAAACAA 


CTTTCGCTAC 


900 
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/SAGCGTTCAC TGCAGCTACA TGTACGCGCC TTATATACCT CACAGGAGTG GCACGATATG 960 

ATCCGCGGGG AACTTGCCTC CGGAAGGCCG GTCTATTATG CAGGGAATAA CCAGAGCATA 1020 

GGACATGCTT TCGTTTGCGA TGGTTATGCT TCGGATGGTA CTTTCCATTT CAACTGGGGT 1080 

TGGGGAGGTG TTTCCAACGG CTTCTACAAA CTAACACTCC TCTCGCCGAC TTCGTTGGGT 1140 

5 ATCGGAGGTG AGGGAATAGG TTTTACCATT TATCAAGAGA TCATCACCGG TATCGAACCG 12 00 

GCTAAGACTC CCGCTGAAGC CGGTACAGAT GCCTTGCCGA TCTTGGCACT GAAAGACATA 12 60 

GAAGCCGAGT ATAAAAGTGA ATCCGGATTG AACGTAGGGT ATTCGATATA TAATACAGGT 132 0 

GAAGAGCAAT CAAATCTTGA CCTCGGATAC AGATTGAACA AGGCTGACGG AGAAGTCATA 13 80 

GAGGTGAAAA CTTCATCTAT CAATATCTCT TGGTACGGAT ACGGAGAGCA TCCCGAGAGT 14 4 0 

10 TTCTCATTGG CACCTAATCA GTTGTCACAA GGAATCAACA CCATCACCCT ACTTTATCGT 1500 

CGCACAGGCA CCGAACAGTG GGAGCCGGTA CGGCATGCAC AGGGAGGATA TGTCAATAGC 15 60 

ATTAAAGTAA ATACGACAGA CCCGAACAAT GTCGTAGTCA CGGTAGATAA TAACGAAGGC 1620 

AAGCTCAGTA TCGTCCCCAA CAGCTTTGTC GCAGATCTGA ATTCTTATGA ACATAGTACG 168 0 

ATTACAGTAC AGTTCAATAG CGACAGCCCT GATGAGATCC GTACACCCGT AGCCTTTGCT 1740 

15 CTATCTACAG GAGCTACTGC GGACGATGTA ATATCTTTGG GCTGGGTAAT GGCTGAAGTT 18 00 

CCGGGCGGTA GCAGCAACTA TCCGGTGGTT TGGTCTAAAG ACGTTCTCAC TCTCTCGGAA 18 60 

GGCGACTATA CATTGTGGTA TAGATTTTCC ATCAACAACC AAAAGGATGA ATGGAAAAAG 1920 

ATCGGAAGCG TGTCAGTAAA AACACCGACA GAGTATACGC ACCCCTTATT CGAAGTGGGC 19 80 

CATAATCAAA CTTCTACCTA TACGCTGGAT ATGGCACACA ACAGAGTATT GCCCGACTTT 204 0 

20 ACACTCAAAA ATCTCGGATT GCCTTTC7VAT GGTGAGTTGG TTGTTGTTTT CCGCCAAACA 2100 

CAATCCTCAT CGGGGTCTTT ATGGGCAGCT CAAGAAACAG TACATATCAA GCAAGGAGAA 2160 

ACTTTCGTAT ATAAACCTGT TGTCGAAGGC CCTATACCTG ATGGATCCTA TCGTGCGACC 2220 

CTCCATGCAT TCGTAAACGG ACAACAACAG TTGTACCTCA AGGGGAAAAG GAACTACACG 2280 

GTGAAGATCG TCAATGGTAC AGCGGTAGAA GCAATAGAAT CGTCAGAAGA GATCAGAGTA 2 340 

25 TTCCCTAATC CGGCACGCGA TTATGTGGAA ATATCGGCAC CTTGCATTCC CCAAGAAACA 2 400 

TCTATCATTC TTTTCGATCT GTCAGGCAAG ATTGTCATGA AGAATAGTTT ATCAGCGGGG 2 4 60 

CATGGCAGAA TGGATGTCAG CCGACTTCCT AATGGGGCCT ACATCCTTAA GGTGGATGGA 2 520 

TATACGACGA AAATAAATAT AGTGCAC 2547 

30 

(2) INFORMATION FOR SEQ ID NO: 2 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3807 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



40 



(ii) MOLECULE TYPE: DNA (genomic) 
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(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: niisc_f eature 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 

GTAAAATGTA TGGGCAAGTA TAAAAGAGCT AAGTACCGCT ATTGGCTTTT TCCTTTCTGT 60 

15 TCGGATTATT ATACCTTTGA GGGAGTTACT TTTTTATGCG CATCTGACGA TATGACAACC 12 0 

AAGAAACCCC AAGCCATTTT AGACTTAGAG AAGGCCTATA ACATTGAAAT TCCTGATCTC 180 

TCCTCACAAG AAGGGATAAG CTGGTCGGTA AATAGATATT TCAAGCAAGA TTCCTCCGGT 24 0 

GCAGTCGTTG AGCTTTGCTT GCGAGAATGT CAGATAGAAA GCATGACTTG GCTTATTGAT 300 

TTTCCTGCTC TAAAAAAGCT TGATCTATCG TATAACCAAA TCAGT7VAGCT AGAGGGTCTA 360 

20 GAACGTCTTA CTTCGTTAP.C AAAACTTCGT CTAAGAAGTA ACCAAATCCG TAAACTAGAG 420 

GGCCTGGATA GTCTCACCTC GCTAACAAAA CTTTCTCTCT CCGATAA.CCA AATCAGTAAG 4 80 

CTAGAGGGTC TGGAACGTCT CACCTCGTTA GCGGAGCTTT ATCTTTTGGA TAACCAAATC 54 0 

AGTAAACTAG AGGGTCTGGA ACGTCTCACG TCCTTAGCAA CGCTTGAACT ATCGGGTAAC 600 

CAAATCCGTA AGCTGGAGGG TCTGGAACGT CTCACGTCCT TAGCAACGCT TGAACTATCG 660 

25 GGTAACCAAA TCCGTAAGCT AGAGGGTCTG GAACGTCTCA CTTCGTTAAC AAAGCTTCGT 720 

CTAAGAAGTA ACCAAATCAG TAAGCTAGAG GGTCTGGAAC GTCTCACGTC CTTAGCAACG 78 0 

CTTGAACTAT CGGGTAACCA AATCCGTAAG CTGGAGGGTC TGGAACGTCT CACGTCCTTA 840 

GCAACGCTTG AACTGTCGGG TAACCAAATC AGTAAGCTAG AGGGTCTGGA ACGTCTCTCT 900 

TCGTTAACAA AGCTTCGTCT AAGAAGTAAC CAGATCAGTA AACTAGAGGG CCTGGAACGT 960 

30 CTCACCTCGC TAACAAAACT TTCTCTCTCC GATAACCA/ikA TCAGTAAGCT AGAGGGTCTG 1020 

GAACGTCTCA CCTCGTTAGC GGAGCTTTAT CTTTTGGATA ACCAAATCCG TAAGCTGGAG 108 0 

GGCCTGGAAC GTCTCACCTC GTTAACAAAG CTTCGTCTAA GAAGTAACCA AATCAGTAAA 1140 

CTAGAGGGCC TGGATAGTCT CACCTCGCTA ACAAAACTTT CTCTCTCCGA TAACCAAATC 1200 

AGTAAACTAG AGGGCCTGGA ACGTCTCACG TCCTTAGCGG AGCTTTATCT TTTGGATAAC 1260 

35 CAAATCCGTA AGCTGGAGGG TCTTGATGGT CTTGCTTCCT TAACAAGGCT TAGTCTAAGG 1320 

CGCAACCAAA TCAGTAAGCT GGAAGGACTA GACAGACTAA AGGTTTTGAG AAAACTTGAT 138 0 

GTTTCGGGCA ATGATATTCA ATCTATTGAT GATATTAAGC TATTGGCTCC GATTCTGGAG 14 4 0 

CAAACTTTAG AAAAACTGAG AATCCATGAC /SiATCCATTTG TTGCATCATC AGGCTTGATA 1500 

CTCTCTCCTT ATGATAATCA TTTGCCGGAG ATTAAAGCTC TTCTTGAAAA AGAAAAAGAA 1560 

40 AAACAGAAAA AGACTTCAGT TGAATATCAC CCATTTTGCA AAGTAATGCT ATTGGGAAAT 1620 
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CATTCTTCGG 


GTAAAACAAC 


ATTTCTTAGT 


C AAT AC G A i A 


A A 7\ TUT" A A 

CAAAi iAlAC 


/^rpAT'/^A/^A A A 

G i A J. CAGAAA 


c a r\ 
1 DOO 




TV 7\ m TV TV rn 

AA.T AC ACA 1 G 


1 Gl TGTCGAT 


TV TV m TV TV 

AC AT C GAAG C 


AATAACCC i A 


A rp/^z-tOArni^rprn 

Ai GCGAi C 1 i 


TTACGACTTT 


17 4 0 




GGGGGAL.AGG 


1 A 1 i A i C A 


y GGGATl 1 AC 


/-•TV Tv/^/^r^rpTTprp 
CAAGCCi i i i 


TiT^Ar^r^Ar^/^r^A 
1 iACCACCCA 


AT^/^'/^n^nPArpA r* 


18 00 




CTTCTCTTTT 


GGGAT GCT AA 


TV TV TV m Ti^ 

GAAGGAT C GA 


TiL TV mmm rTi TV 

AACTTTGl GA 


/"•/"•/"•rn A A m TV 

G C GT AGAT GA 


m A A A ^ A A m A m 

1 AAAGAAT AT 


1860 


r- 

O 


CAGACTCTTA 


TV mmm tv tv m 

ATTTCAATCG 


CCCCTATTGG 


mm TV TV TV TV 

TT AG GAC AGA 


m TV /^m TV m 

TAG C CTAT GC 


^^m/^/^TV Tvrrirf^^^m 

CTGCAATCGT 


1920 




TGTATGTCCG 


TTGGAGGAAA 


TCCTGATGGC 


AAGGACACAC 


C ACAGACC AC 


AGAC GAT ACA 


1980 




ATT AT CATT C 


AGACTCATGC 


CGATGAAACG 


GGCGCTAAGC 


AGCAAACCTT 


AGGCTGTGCA 


2040 




GC C GAGAAT G 


GAGTATTGGA 


AGAAAT CTAT 


GT AT C C T TAG 


AG C C CAAG GC 


GAAT AGT G C C 


2100 




GTACATGCGC 


TCAACTATCT 


GAATGAGCGG 


GT GC GAGAAG 


TTGTCGCAAG 


CAGGAGTAAA 


2160 


ID 


T CAATT CAGA 


T C AC AG AAAA 


AGATAAGGGA 


TT GTAC GAAG 


CTCTTCCCAC 


AATCGCCGGT 


2220 




GATAATAAAC 


ACATCCCTAT 


CTCTCTCGAA 


GCTCTTGCGG 


CT CAATT GAA 


TAAGGGAAGA 


2280 




G CT GAAAAT G 


ATCTTTACAC 


CATAGAGTAT 


CT ACAGACC G 


TV TV ^\TX\ TV TV ^ TV 

AATT GAAC CA 


GCTTAGTCTG 


2340 




CGAGGGGAGG 


TGCTTTACTA 


T C GT GAGAAT 


TV TV TV /^m TV 

GAGAAGCT GA 


tk. TV TV mm TV m 

AC AATT AT GT 


y 1 y^ y^rnm tv tv m 

CT GGTTAGAT 


24 00 




CCGGCAGCTT 


TTGTCCAAAT 


GATT CAT GGA 


GAAATCCTCC 


TV TV TV TV TV TV TV TV 

AAAAAGACAA 


TV m TV TV m TV TV 

CAT CAAT AGA 


2 4 60 


lb 


GGAACAGTT C 


CTAAAGACAT 


TTTTGAATGC 


AAACT GCAT A 


TV m/^mT\ TV ^^mm/** 

ATCTAAGTTC 


^< TV TV m TV m TV 

CGGAAGTATA 


2520 




TTT GAAGAAG 


ATGGCCAAAA 


T GGTAATAT G 


ATCTTGCAGC 


m TV m m tv mnr tv 

TATT ATT GGA 


TV TV r^tT% TV m 

AGAGCT GAT C 


2580 




GTATATGAAG 


ATAAGGACTG 


CTATGTGATA 


CCGGGCTATC 


TCCCTTTGCA 


TTCCGATGAC 


264 0 




GAAGCCTATA 


AATGGCTTAC 


TTTGGGATTC 


GAGAGGCCCA 


ATTTTGTCCT 


C AAAT T C GAA 


2700 




CGTTTTATCC 


CCTTTGGCCT 


GATCAACCAG 


ATTATAGCCT 


ACTATGGCCG 


GGAAGAAGGT 


2760 


20 


GCTCTAAAGC 


GGTATTGGCG 


AGATCAGGTC 


ATCTTCACAG 


CAGGCCGTGA 


GATGGATAGG 


2820 




CAAACGCTTG 


AGCAAGAAGA 


AGAG/SkAAGAG 


GGTTTGCCCA 


AGACGAATGC 


CGAGGATTAT 


2880 




CAGATCTGGA 


TCAAGCTCGA 


CTTTACCGAC 


TTGGCCATAT 


CCGTATTCAT 


CAAAGAGCAG 


2940 




AGAAAGACAT 


CAGCTAAGGA 


TATGCAGCGG 


AAAGAGGCTA 


CTATCCTCAG 


TGATATGTTG 


3000 




GAT AT GT ATT 


GGAACAATAT 


CCCTCCGAGG 


GAGCAAATAG 


GAGATAAGGA 


T AC GGAGCAA 


3060 


25 


ACGAGAAGCA 


CTATTCGTGA 


AACAAACAGA 


AAGAAGAGAC 


CCATCCAGGA 


TCTCTACCTC 


3120 




TCCTGTGCCC 


AAGCGGATAA 


AGATTTGACG 


GAGTCTCATT 


ATATCCATTT 


GGGCACGCTG 


3180 




GACGAT GAAA 


GCAAGACTAC 


GGCGAGGATT 


GCAGCCTATC 


/ ^mm TV TV TV TV 

CGTTGAAGAA 


CGGCGTTATC 


3240 




GATAAAGAGC 


GGGTGCGAGA 


AGTATCGACT 


CGT C CCTACA 


TV TV TV m g ' irnm m 

AACATCTTTC 


*■ 11 ^* TV TV mTV TV TV 

CGT CAATAAA 


3300 




AATCTGGCTA 


CT GCAAAACA 


GAT CTTT ATT 


T C CT ATT C C A 


AAGAGGAT CA 


TV ^^ITl^^ TV TV 

GACT GAACT G 


3360 


30 


GAGACCTGTC 


TGCAATTTTT 


CAAACGCTTG 


GAGAAGAATG 


GTCAGATCGA 


GATCTACTAT 


3420 




GATAAGTTGA 


CTAAGTTTGA 


AACACCTATT 


CACCCTGAAA 


TAAGAAAGCG 


TATTGTCGAA 


3480 




GCCGACTGTA 


TAATCGCTTT 


GATCAGCCAA 


CGCTATCTGG 


CCACGGATTA 


CATCCTGGAT 


3540 




CATGAGTTGC 


CTGTATTTCG 


GGAGTATAAC 


AAGACCATAG 


TGCCGATATT 


GATCAAGCCT 


3600 




TGTACATTCG 


AAGACGATGA 


GTTCCTTCGG 


GAGAAATATT 


TTGCTCAGAA 


AGCTCAAATA 


3660 


35 


ATCAATCTTG 


GAAAAGAGGG 


AAAAACCATT 


AAAGCTTATG 


ATAGTATTAC 


GGCATCAGCC 


3720 




CATCGTGATG 


AAAATTGGGT 


GGCAGTAGTC 


AGAGAGTTCA 


AAGAGAAGAT 


ATTAAGAATA 


3780 




ACAAAACAGG 


AGGTAAATAC 


AGATGAA 








3807 



40 



(2) INFORMATION FOR SEQ ID NO: 3 



17 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1131 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

10 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: niisc_f eature 

(B) LOCATION 1, . . 1131 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 





TTGTTTAATC 


ATAAAAAATC 


ATGGTATGAA 


ACATTTCAAT 


TTTATCTCGT 


TGTTTTCCGC 


60 




TCTGGCTTTA 


TTCTTTTGTG 


TGAAAATACC 


CTTGCACAAC 


AAAAAACAGA 


GGAGTTTGCA 


120 


25 


-CCTGTGTCGG 


ATTTACGTGC 


AGAAGCGTAC 


GGCTCTACCG 


TTTTCCTCCA 


CTGGACTCCG 


180 




CCGTATGACA 


ATCCGATGAT 


TCCTCTAAGC 


GAGAGTTTTG 


AATCAGGTAT 


TCCAGCTATA 


240 




TGGAAGACCA 


TTGACGCAGA 


TGGCGATGGC 


TATAATTGGA 


TGCATTTGAC 


CAATTTCACG 


300 




GGACAGAGTG 


GTCTCTGTGT 


CTCTTCGGCT 


TCATACATAG 


GCGGCGTCGG 


AGCTTTGACT 


360 




CCGGACAATT 


ATCTGATAAC 


ACCCGAATTA 


AAACTACCCA 


CAGACGCGTT 


GGTGGAAATA 


420 


30 


ATCTATTGGG 


TATGTACTCA AGATCTCACT 


GCTCCATCGG 


AGCACTATGC 


CGTTTATTCC 


480 




TCTTCTACAG 


GCAATAATGC 


TGCTGACTTT 


GTTAATCTCT 


TATATGAAGA 


GACTTTGACT 


540 




GCCAAACGGA 


TACAATCCCC 


CGAGTTGATC 


CGCGGAAATC 


GGACACAAGG 


TGTTTGGTAT 


600 




CAAAGAAAGG 


TGGTACTCCC 


TAACGATACT 


AAATATGTTG 


CTTTCCGCCA 


TTTTAATTCC 


660 




ACGGATAATT 


TCTGGCTCAA 


TTTGGATGAA 


GTATCTATCC 


TGTATACCCC 


TCTTCCCCGA 


720 


35 


AGAGCTCCGT 


GTCCGCATCC 


GGGTGGTTAC 


ACTTATTCTG 


TATTCCGTGA 


TGGACAAAAG 


780 




ATAGCGAGTG 


GATTGTCGGC 


ATTGGCATAT 


ATCGATACGG 


ATGTACCGTA 


TGGGACTCAA 


840 




GACTATTGTG 


TCCAAGTCAA 


TTATCTGCAA 


GGAGACTCGT 


ATAAAGTCTG 


CAAAAATATA 


900 




GTGGTGGCAA 


ATTCTGCAAA 


CATCTATGGG 


GCGGATAAGC 


CTTTTGCGTT 


GACCGTGGTT 


960 




GGCAAGACCA 


TTGTAGCGAG 


TGCTTTCAAA 


GGAGAGATCA 


CTCTTTATGA 


CATTCGTGGC 


1020 


40 


CGGCTGATAG 


CTTCCGGCTG 


CGATACGCTT 


AGGTACAAAG 


CGGAAAATGG 


TTTTTACCTC 


1080 



18 



5 



15 



20 



30 



35 



ATTAAAATAC AGGTAAACGG AACTGTCTAT ACTGAGAAAA TCCAAATCCA A 1131 



(2) INFORMATION FOR SEQ ID NO : 4 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2289 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
10 (D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY: misc_f eature 

(B) LOCATION 1. . .2289 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 



40 



CATTTTTTAG 


TAACGATTAT 


GCGGACAAAA 


ACTATCTTTT 


TTGCGATTAT 


CTCTTTTATT 


60 


GCTCTATTGT 


CGTCTTCTCT 


GTCGGCTCAG 


AGCAAAGCCG 


TTTTAACCGG 


TAGTGTGTCG 


120 


GATGCCGAAA 


CCGGAGAGCC 


TCTTGCCGGT 


GCTCGAATCG 


AAGTCAAACA 


CACCAACATA 


180 


GTAGCCGGTG 


CCGATGCCGG 


CGGACATTTC 


GAGATCAAGA 


ACCTGCCGGC 


AGGGCAGCAT 


240 


ACTATTATAT 


GTTCGTTGGG 


GGGGTATGGA 


CAGAAAGAGG 


AGGTGGTTGC 


CATCGAAGCC 


300 


GGACAGACCA 


AAACGATCTC 


TTTTGCATTG 


CGACTGCGAA 


CGAACAACTT 


GGAGGAAGTC 


360 


GTCGTTACCG 


GTACCGGTAC 


ACGTTACCGC 


TTGGTCGATG 


CTCCTGTGGC 


AACGG7VAGTC 


420 


CTTACCGCTA 


AGGACATAGC 


CTCTTTCTCG 


GCTCCTACTT 


CCGAGGCCTT 


ATTGCAGGGG 


480 


CTGAGTCCGT 


CTTTTGACTT 


CGGCCCCAAT 


CTGATGGGCT 


CTTTCATGCA 


GCTGAACGGC 


540 


CTTAGCAGTA 


AGTATATCCT 


CATCCTTATC 


GATGGTAAGC 


GTGTGTACGG 


CGATGTAGGC 


600 


GGTCAGGCCG 


ATTTGAGTCG 


TATTTCTCCT 


GATCAGATCG 


AACGGATCGA 


ACTGGTGAAA 


660 


GGTGCTTCGA 


GTTCGCTCTA 


CGGATCCGAT 


GCCATCGCCG 


GGGTAATCAA 


TGTGATCACA 


720 


AAAAAGAATA 


CGAATCGACT 


GAGTGCATAT 


ACGTCACATC 


GCATATCGAA 


GTACAACGAT 


780 


CGGCAAACCA 


ATACTTCGCT 


CGATATAAAC 


ATCGGTAAGT 


TCAGTAGCAA 


TACCAACTAT 


840 



TTCTTCTACC ATACGGATGG CTGGCAGAAT 
TCCGGCGAAC CGGTCTTGGA GGAAACGTAT 
GGTGTAAGCC AATCGCTTTC CTATTATGCA 
CAGTACAATA AACGTCAGAT CTTCACTCCG 
5 GATTATCGTG CTTTGACGGC TTCACTCGGT 
ACGCTTTCTT TCGATGCCGT CTACGATCGC 
GACAGCAGTG AGAGCCTGAT CAACAACCAA 
GGTCAGCTAC GCAATAAAAA CGATCAGATC 
ACACTGCCTT ATGCGCAGAA ACTGACCGGC 
10 TCTCCCTATA ATTTGATTAC CGACAAGGCA 
GATGAATGGA AACCGCTCGA TTGGTTCAAT 
CAGGAGTTCG GTACACGAAT GACGCCTAAG 
AACTTCCGCG CTACGTATGC TAACGGCTAT 
CGGAACGAAC TCACCACTAT GGGTTCGCAC 
15 CCACAGATGT CGGATTATTA TGCTTTGGGC 
AGTGCAACGG TTTATGACAA TGAACTTCGC 
TCACCCGAGC ACGAAGCTCA GGGAATCAAG 
GCTCGCAGCC GCGGCCTTGA TGTCCTATGT 
GGAGCCGGAT ACAGCCTCGT GGAAGCTAAG 
20 GCTGCACGTC ATCGTGCCAA TGTGCACGCC 
CTTGGCGTGA GCCTTTTCGG CCGTATTCAG 
CCGGACTATA CCTTGTGGCG ACTCGCCACA 
ATCCTGGATG GAACGCTCGG TATAGACAAC 
GGTGTCAATT ATGCTACCGT AACGCCGGGA 
25 -TTCAACAAC 



19 



AGTCCGTTCG 


AAATAAAAAA 


GAAAAAAGGA 


900 


AAGAAAACTT 


TTCGTGCACA 


GGAAAATCAG 


960 


ACTAACAATC 


TTAGCTTCAG 


CGGAAATGTG. 


1020 


ACTTTTTCCG 


AAAAGAAGGC 


CTATGACATG 


1080 


ACGAACTATC 


TTTTCCCCAA 


TGGTCTGCAT 


1140 


TTCCGTTTCG 


GATATTTGTA 


TCATGACAAG 


1200 


GGTCAGACCG 


AGCAACCCAC 


ATTCTTTCCG 


1260 


CGATACACGG 


CAGAGGCTCG 


CGGTGTATTT 


1320 


GGTTTGGAGT 


ATTTCCGTGA 


GGAATTGATC 


1380 


GATGCTTCCA 


CGCTCTCTGC 


TTATGTACAA 


1440 


ATGACAGCCG 


GTTTCCGTCT 


GGTACACCAT 


1500 


GTATCCATAC 


TCGC-CAAGTA 


TGGGCCGCTG 


1560 


AAGACTCCCA 


CGCTGAAAGA 


GCTTTTTGCA 


1620 


AATCTCTATC 


TCGGCAATGC 


GGATCTTAAG 


1680 


TTGGAGTACA 


ATCAAGGCCC 


TATCTCGTTC 


1740 


AATCTGATCT 


CCTTTATGGA 


TATACCGACC 


1800 


AAAACCAAGC 


AGTATGCCAA 


CATAGGAAAA 


1860 


GATGCCTCTA 


TCGGTTGGGG 


TATCAAGTTA 


1920 


AATCTCCAGA 


CGGATGAGTG 


0"^ m TV TV /"* TV 

G C T G GAAGoA 


1 Q ft n 
± y o u 


GATTGGGTTC 


ACTACTGGGG 


TCAGTATAGA 


2040 


AGCGAGCGTT 


ACTACAAAGA 


CGGCAATGCT 


2100 


TCGCATCGTT 


TCGCTCATTT 


CCGCCACATC 


2160 


CTGTTTGACT 


ACGTGGATGA 


TCGTCCTATG 


2220 


CGTACTTTCT 


TTGCTCAAAT 


AGCGATTCGA 


2280 








2289 



(2) INFORMATION FOR SEQ ID NO : 5 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1746 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 



35 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



40 (iv) ANTI-SENSE: NO 



20 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 



5 (ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1. . . 1746 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 

10 





TCAAAAATAG 


TACTACGAAA 


GTTTTGTACC 


TTAGCGCGCA 


TGAAAAAGAC 


CAATCTGTTT 


60 




TTATCTCTGC 


TGGTGATCTT 


TATCACCGGT 


AGTTTTATGA 


CTGCCTGTGC 


ACAGT^lAGTCC 


120 




AAGACGAACA 


AACTCACCGA 


AGAAGATCGG 


AGCCGCAATG 


AGTATGTACA 


GTCGATGGAT 


180 




GTGCTTAGCA 


ATATTATCGG 


TAACGTCAGG 


CTGTATTTCG 


TCGATACCAT 


AAGTATCAAA 


240 


15 


CATATGACTC 


GGCGTGGTAT 


AGATGCGATG 


TTGGGCGGGC 


TTGACCCCTA 


TACCGAATAC 


300 




ATTCCTTACG 


AGGAAATGGA 


TGAACTGAAA 


TTGATGACTA 


CGGGAGAGTA 


TGCCGGAGTC 


360 




GGAGCTATCA 


TATCGCAGCG 


CCCGGATAGT 


GCTGTGATTA 


TCCAGAGACC 


TATGGAAGGT 


420 




ATGCCCGCAG 


ACGAAGCAGG 


ATTGATAGCA 


GGCGACCGCA 


TCCTGACTAT 


CGATGGGAAA 


480 




GACTTCCGCA 


AATCCACCAC 


ACCGAAAGTA 


AGCCAAGCAC 


TGAAAGGGAT 


AGCCGGTACT 


540 


20 


GTTGCAAAGG 


TGACAGTAAT 


GCGCTATGGC 


GAAACCAAAC 


CTCGTACTTT 


TTCCGTGAAA 


600 




CGTCAAAAAG 


TGATTATGAA 


TTCCGTCACT 


TACAGCGGAA 


TGCTCGATGG 


CTCGATAGGA 


660 




TATATCCGCT 


TGAACAACTT 


TACGGACAAA 


AGTGCAGAAG 


AGGTGCGCAC 


GGCCTTGTTG 


720 




GATCTTCGTG 


ACAAACAAGG 


AGCGAAAGGT 


CTCATTTTGG 


ATTTAAGAGG 


C7\ATGGTGGC 


780 




GGACTGATGC 


AGGCTGCTAT 


CGAGATAGTC 


AATCTGTTCG 


TCCCTAAGGG 


CAAAGAGGTG 


840 


25 


GTAACGACCA 


AAGGTCGCAT 


TGCAGAGTCG 


GCGTCCGTAT 


TTCGCACATT 


GACTGAACCG 


900 




ATCGACACGA 


AACTCCCGAT 


AGTAGTCCTG 


ATCGATGGAC 


AATCGGCATC 


TTCCTCGGAG 


960 




ATTGTAGCCG 


GAGCACTGCA 


GGATATGGAC 


AGGGCTGTAC 


TGATGGGACA 


A7\AGAGCTAT 


1020 




GGCAAAGGGC 


TTGTACAAAC 


GACTCGTCAG 


CTACCATACA 


ACGGCGTGAT 


CAAATTGACT 


1080 




ACGGCCAAGT 


ACTACATCCC 


AAGCGGACGT 


TGTATTCAGC 


GTTTGGACTA 


CAGCCGCACC 


1140 


30 


AATCGGACAG 


GTATGGCAAC 


GGCCATTCCT 


GACAGTCTGC 


ACAAAATCTT 


TTACACTGCT 


1200 




GCCGGAAGAC 


GTGTAGAAGA 


TGCAGGAGGA 


ATCCTGCCTG 


ACATCGAGGT 


CAAACAAGAT 


1260 




ACAGCTGCGA 


CATTACTTTA 


TTATATGGCC 


ATCAATAATG 


ACGTTTTCGA 


TTTCGTCACA 


1320 




GGTTATGTGC 


TCAAGCATAA 


AACGATTGCC 


AAGCCGGAGG 


ATTTTTCCAT 


AACGAACGAG 


1380 




GACTATGCAG 


CTTTCTGCAA 


GATGATGGAA 


GAAAAGAAAT 


TTGACTATGA 


TCGCCAGAGT 


1440 


35 


GGCAAGATGC 


TTGACAAACT 


GGAGGAACTG 


GCTAAGATAG 


AAGGCTACCT 


GCCGGAAGCC 


1500 




AACTCGGAGC 


TTAAAGCACT 


ACGCGAAAAG 


CTAAAACCCA 


ACCTGTCGCG 


TGATCTGCTA 


1560 




CGATTCAAAA 


AGGAGATAAC 


AAACTATCTC 


AACAATGAGA 


TTGTCACTCG 


CTATTATTAT 


1620 




GAGCGAGGCA 


GTATCCGCCA 


GAGTTTGCCG 


GAAGATAAGG 


TAGTCAAAGA 


AGCTATTAAG 


1680 




CTGCTGAAGG 


ACCATCCGGA ACAAATTCGA 


CAGATCCTTG 


CAGCTCCGAA AGCAGAGAAT 


1740 


40 


AAAGGG 












1746 



21 



(2) INFORMATION FOR SEQ ID NO : 6 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 885 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

10 

(ii) MOLECULE TYPE: DNA (genomic) 

( iii ) HYPOTHETICAL : NO 

15 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

20 (ix) FEATURE: 

(A) NAME/ KEY: misc_feature 

(B) LOCATION 1. , .885 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 

25 



ACGAAGAAAA 


CACTGATGAA AAAGCTATTT 


CTCTCGCTCA 


CGAGTCTTGT 


AATGGTCTTC 


60 


GCTGTTGCAA 


GTTGCGATAT 


AATCGACAAG 


GATCAAACCC 


TCTTGCCGGC 


TCCGACCAAT 


120 


GTGACACCCG 


ATAATCCGGA 


TGACAATCCT 


TCGGAGATCG 


ACATTACGCA 


GACGCACACA 


180 


GAAAAATATG 


TTTTGGCTGA AGAATTTACC 


GGCCAAAAAT 


GTCTCAACTG 


TCCGAAAGGT 


240 


CATCGCAAAC 


TGGCGGCTCT 


CAAGGAGCAA 


TACGGTAAGA 


GATTGACTGT 


TGTCGGTATA 


300 


CATGCCGGCC 


CTGGATCTCT 


CGTGCCACCT 


CTTTTCCGTA 


CAGAAGCCGG 


AGACGCATAT 


360 


TATAGCAAGT 


TCGCCAATAA 


TACCCCTCTC 


CCTGCGCTGA 


TGGTTTCGCG 


CAAAAAGTTC 


420 


GGCTCTTCCT 


ACGTTTATGA 


TAAGAGCTAC 


AAAACGTGGG 


ACGTGCCTAT 


TGCCGAGCAG 


480 


ATGGAGCAAA AGGCGAAGAT 


CAATATCTTT 


GCCGTGGCCG 


AATACACCGA 


TACCCAAAAG 


540 


ATCAAGGTGA 


CTGTAAAGGG 


TAAAATACTG 


GAGGGGAATA 


CACTCCCGAA 


GTCCATGGTT 


600 


CAGGTGTATC 


TGTTGGAGGA 


TAAGCTGATC 


GCTCCGCAGG 


TGGATGGCAA 


TACGACAGTC 


660 


GAGAATTACG 


AGCACAATCA 


CGTGTTGCGT 


GGAGCCGTTA 


ATGGTATTTG 


GGGCGAAGAA 


720 


TTTGTGAATC 


TCAAAGATTA 


TTTGTATACT 


TACGCCGTTG 


AACCGCTCTC 


GGGTATGTCC 


780 


TTCGTAGCCG 


AGAATTATTC 


GATTGTGGCT 


TTTGTATACG 


ATGTGCAGAC 


GTTCGAAGTG 


840 


TATGACGTTG 


TGCATGTAAA 


GATCAATCCG 


CAATCCGATG 


GCAAA 




885 



22 



10 



25 



30 



35 



(2) INFORMATION FOR SEQ ID NO : 7 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3138 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

15 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

20 (ix) FEATURE: 

(A) NAME/KEY: mis c_f ea ture 

(B) LOCATION 1...3138 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 



40 



GACTTCCCTT 


GGGTTAGGGT 


AAAACCGGAG 


AAGAAAAGAA 


AACAACATAA 


CAGTAATAAT 


60 


TTTAAGTTTA 


ACGCAAAAGA 


AAAGTCTATG 


A7\AAGAATGA 


CGCTATTCTT 


CCTTTGCTTG 


120 


CTGACGAGCA 


TTGGGTGGGC 


TATGGCCCAG 


AATAGAACCG 


TGAAGGGTAC 


AGTTATCTCC 


180 


TCCGAGGATA 


ATGAGCCCCT 


GATCGGCGCG 


AATGTCGTGG 


TTGTCGGAAA 


CACCACTATC 


240 


GGTGCTGCAA 


CCGACTTGGA 


TGGCAACTTC 


ACGCTTAGCG 


TGCCTGCCAA 


TGCCAAAATG 


300 


TTGAGAGTGT 


CCTATTCCGG 


TATGACTACC 


AAAGAGGTCG 


CCATCGCTAA 


TGTGATGAAG 


360 


ATCGTACTGG 


ATCCGGACTC 


TAAGGTTCTG 


GAGCAGGTAG 


TTGTATTGGG 


TTACGGTACG 


420 


GGACAGAAAC 


TCAGCACTGT 


TTCCGGTTCT 


GTGGCCAAAG 


TGTCCAGCGA 


AAAGCTCGCG 


480 


GAAAAGCCCG 


TTGCCAATAT 


CATGGATGCC 


CTCCAAGGTC 


AGGTAGCCGG 


TATGCAGGTT 


540 


ATGACTACAT 


CCGGTGACCC 


TACTGCCGTC 


GCTTCTGTGG 


AGATCCATGG 


TACAGGGTCG 


600 


TTGGGGGCAA 


GCTCTGCACC 


ATTGTATATC 


GTGGATGGTA 


TGCAAACTTC 


TTTGGATGTT 


660 


GTGGCTACGA 


TGAATCCGAA 


TGATTTTGAA 


TCTATGTCCG 


TTTTGAAAGA 


TGCTTCTGCA 


720 


ACATCTATTT 


ATGGAGCTCG 


TGCTGCAAAC 


GGAGTCGTTT 


TCATTCAAAC 


GAAGAAAGGT 


780 


AAAATGAGCG 


AGAGAGGTCG 


TATTACCTTT 


AATGCCAGTT 


ACGGGATTTC 


TCAAATCCTG 


840 


AATACTAAGC 


CCCTTGATAA 


TATGATGACT 


GGAGATGAAT 


TGCTGGATTT 


TCAGGTGAAG 


900 



23 



10 



GCAGGTTTTT GGGGGAACAA TCAAACCGTT CAGAAGGTTA AAGATATGAT CCTTGCCGGA 960 

GCTGAAGATT TGTATGGCAA TTATGATTCT TTGAAAGATG AGTATGGTAA GACATTGTTC 1020 

CCAGTGGATT TTAATCATGA TGCAGACTGG CTCAAGGCTT TGTTTAAAAC AGCACCCACC 1080 

AGTCAAGGTG ATATTTCTTT CTCCGGAGGG TCTCAGGGAA CTTCATATTA TGCCTCTATA 1140 

GGCTACTTCG ATCAGGAAGG TATGGCTCGT GAACCGGCAA ATTTTAAGCG CTATAGTGGC 1200 

CGGCTCAACT TCGAAAGTCG TATCAATGAA TGGCTGAAAG TTGGTGCAAA TTTGTCTGGT 12 60 

GCGATAGCGA ATAGACGATC TGCCGACTAT TTTGGAAAGT ATTATATGGG GTCAGGTACT 1320 

TTCGGTGTGT TAACGATGCC TCGTTATTAT AACCCTTTTG ATGTGAATGG GGATTTAGCA 13 80 

GATGTCTATT ACATGTATGG AGCTACCAGA CCTTCTATGA CAGAACCGTA CTTCGCAAAA 1440 

ATGAGACCGT TCAGTTCCGA ATCACATCAG GCCAATGTAA ATGGTTTCGC CCAGATTACT 1500 

CCGATCAAAG GCCTTACTTT AAAGGCACAG GCTGGTGTTG ATATTACTAA TACTCGCACT 1560 

TCTTCTAAGA GAATGCCCAA TAATCCGTAT GATTCTACTC CTCTTGGGGA AAGAAGAGAA 1620 

AGAGCTTATC GAGATGTTAG CAAGTCTTTT ACAAATACGG CTGAATATAA GTTTTCAATT 168 0 

GATGAAAAAC ATGATCTTAC AGCATTGATG GGGCATGAAT ATATTGAATA TGAAGGGGAT 174 0 

15 GTTATTGGGG CATCTTCTAA AGGATTTGAA AGTGATAAGT TGATGTTACT GAGCCAGGGA 18 00 

AAAACCGGAA ATAGTTTGTC TTTGCCTGAA CACAGAGTCG CTGAATATGC CTATTTGTCT 18 60 

TTCTTTAGTC GTTTTAATTA CGGTTTTGAC AAATGGATGT ATATAGATTT CTCTGTTCGT 1920 

AATGACCAAT CCTCTCGATT CGGATCCAAT AATAGAAGCG CGTGGTTCTA TTCTGTCGGT 198 0 

GGAATGTTTG ACATATATAA TAAATTCATT CAAGAAAGTA ATTGGCTCAG TGATCTTCGA 204 0 

CTGAAAATGA GTTATGGTAC AACGGGTAAC TCGGAGATTG GTAATTACAA CCACCAAGCA 2100 

CTCGTTACTG TGAACAATTA TACTGAAGAT GCTATGGGGC TTAGCATTTC TACAGCAGGC 2160 

AATCCCGACC TCTCGTGGGA AAAGCAGTCT CAGTTCAACT TCGGTTTGGC TGCAGGGGCT 2220 

TTCAATAATC GCTTATCTGC AGAGGTAGAT TTCTATGTCC GCACTACGAA TGATATGTTG 2280 

ATTGATGTCC CGATGCCTTA TATCAGTGGT TTCTTCTCAC AGTATCAGAA TGTAGGCTCT 2340 

25 ATGAAAAATA CGGGTGTAGA CCTTTCTCTT AAGGGGACGA TCTACCAAAA TAAGGACTGG 2400 

AATGTATATG CTTCTGCGAA TTTCAACTAC AATAGACAGG AAATAACAAA GCTTTTCTTC 2460 

GGTCTCAATA AGTACATGTT GCCTAATACC GGTACTATAT GGGAAATTGG GTACCCCAAT 2520 

TCGTTCTATA TGGCTGAATA TGCTGGAATC GACAAAAAAA CCGGTAAGCA GTTGTGGTAT 2580 

GTTCCTGGTC AAGTCGATGC GGATGGTAAT AAAGTTACAA CAAGCCAGTA CTCAGCTGAC 2640 

30 TTGGAGACAC GAATTGATAA GTCTGTTACT CCTCCTATTA CAGGTGGTTT CTCCTTAGGT 2700 

GCTTCTTGGA AAGGACTTTC TTTAGATGCT GATTTTGCCT ACATCGTTGG TAAATGGATG 27 60 

ATCAATAATG ACCGTTACTT TACAGAGAAT GNAGGTGGAT TGATGCAATT AAATAAAGAT 2 820 

AAAATGCTAT TGAATGCCTG GACAGAGGAT AATAAAGAAA CAGATGTTCC AAAATTGGGA 2880 

CAGTCTCCTC AGTTTGATAC GCATTTGTTG GAGAATGCTT CTTTCCTGCG TTTGAAGAAT 294 0 

35 CTCAAACTCA CCTATGTACT CCCCAATAGT CTTTTTGCTG GGCAGAATGT GATTGGTGGA 3000 

GCTCGTGTCT ATTTGATGGC GCGCAATCTG TTAACTGTTA CGAAGTATAA AGGCTTTGAC 3060 

CCTGAAGCAG GGGGGAATGT GGGAAAAAAT CAATATCCTA ATTCTAAGCA GTACGTTGCG 3120 
GGTATTCAGT TGTCTTTC 



20 



3138 



40 



24 



(2) INFORMATION FOR SEQ ID NO : 8 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2583 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

10 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

15 (vi) ORIGINAL SOURCE: 

(A) ORG7VNISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc feature 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 

TTAGAGATGG CATACGACTT TACACAAACA TTCCGCAACA GCCTGGAGTA CAGCTATCAG 60 

25 "GAAGCAACCC GTCTCGGCGT CGTAGCCGTG ACGCAAGATA TGCTCGTACT CGGTATCATT 120 

CGCGACGGAG ACAATGGCGC GATCGACATC ATGCGGCACT ATGGGATCAA CTTGTACGAA 180 

CTCAAACGGT TGATCGAGTT GGAAGCCATC GCCGAGAGTT TGCCTGCTTC GCCTGAGGGA 24 0 

TCGCCCATCT TCACCCCTTC GGCTCGGGAG GCTATCGATG ATGCCACAGA CATCTGTGCC 300 

GACATGGAGG ACGAGGCCGT CAGCCCGGTC CATCTGTTGC TGAGTATCCT CAACTCGACA 360 

30 CAGGAGAGCT TAGTACAAAA GATATTTATG AAACAAGGTA TAAAATACGA CACCATCCTG 42 0 

TCGGATTACT TCGGACAGCG CAACCCCTCC GAAGGGAAGT CTCCCTCCGA AATGGAGATC 480 

CTCGACGGGT ACCAAGACAA CGACTTCGAC GACGAAGAGG ACGAATCCTC TCCGCCTTCC 540 

GGGAATAGCG GGACAGGCGG AGGCTCCGGC GACGCCCCCG AACAGAATAC CGGCGGAGGC 600 

GATACTACCA CCACGACACG GAGTGGAGGC GACACGCCTG CACTGGACAC CTTCGGCACC 660 

35 GACATCACTG CCATGGCGGC AGCAGGCAAG CTCGACCCGG TAGTGGGTCG GGAGCAGGAG 720 

ATCGAAAGGG TGATACAGAT ACTCAGCCGG CGCAAAAAGA ACAATCCGGT GCTCATCGGC 7 80 

GAACCCGGTG TAGGCAAGAG TGCCATCGTG GAAGGACTGG CCGAACGCAT CGTGAACAGG 8 40 

AAGGTGAGCC GTATTCTTTT CGACAAGCGG ATCATCAGCC TCGATTTGGC TCAGATGGTA 900 

GCCGGCACCA AATATCGCGG ACAGTTCGAA GAGCGGTTGA AAGCCGTGCT CGATGAGCTG 960 

40 AAGAAGAATC CGCAGATCAT CCTCTTCATC GACGAGATAC ATACCATCGT GGGAGCAGGC 1020 



25 



TCTGCAGCCG GATCGATGGA TACGGCCAAT ATGCTCAAAC CCGCTCTTGC CCGTGGACAG 1080 

GTACAGTGCA TCGGAGCCAC TACGCTGGAT GAGTATCGTA AGAACATAGA AAAGGACGGA 1140 

GCACTCGAAC GCCGCTTCCA GAAGGTGCCG ATAGCCCCCT CGACTGCAGA AGAAACGCTG 1200 

ACCATCCTGC AAAACATCAA AGAGAAATAC GAGGACTATC ACGGTGTACG CTATACGGAC 12 60 

5 GAAGCGATCA AAGCGGCAGT GGAACTGACC GATCGCTATG TATCCGATCG TTTCTTCCCA 1320 

GATAAGGCGA TAGATGCCAT GGACGAGGCC GGCGCGAGCG TCCATATCAC CAATGTGGTG 1380 

GCTCCGAAAG AAATCGAGAT ACTGGAGGCC GAATTGGCAT CGGTGCGAGA GAACAAGCTC 144 0 

TCGGCCGTAA AGGCTCAGAA CTACGAACTG GCTGCCTCCT TCCGCGATCA GGAGCGGCGC 1500 

ACTCAGCAGC AGATAGCGGA AGAGAAGAAA AAATGGGAAG AGCAGATGTC CAAGCACCGC 1560 

10 GAGACGGTGG ACGAGAATGT AGTGGCGCAT GTAGTGGCGT TGATGACAGG CGTTCCGGCT 1620 

GAGCGGCTGA GCACGGGCGA AGGCGAACGT CTGCGCACGA TGGCAGATGA TCTCAAGACC 1680 

AAAGTAGTAG GTCAGGACAC AGCCATCGAA AAGATGGTGC ATGCCATCCA GCGCAATCGT 17 4 0 

CTGGGACTTC GCAATGAAAA GAAACCGATC GGTTCTTTCC TTTTCCTCGG CCCCACGGGG 1800 

GTAGGCAAGA CCTATTTGGC CAAGAAGCTC GCCGAATACC TGTTCGAGGA TGAGAATGCC 18 60 

15 ATGATCAGGG TGGATATGAG CGAGTATATG GAGAAGTTCT CCGTTTCGCG TCTCGTGGGT 1920 

GCCCCTCCGG GATATGTGGG CTATGAAGAA GGCGGCCAAC TGACGGAGCG CGTAAGACGC 1980 

AAACCCTATT CCGTGGTTCT CTTGGATGAG ATCGAAAAGG CGCATGCCGA TGTCTTCAAT 2040 

CTGCTCTTAC AGGTGATGGA CGAAGGTCAG CTGACCGACA GTCTGGGACG GCGCGTGAAT 2100 

TTCAAGAACA CCGTGATCAT CATCACCTCC AACGTGGGTA CACGCCAGCT CAAAGACTTC 2160 

20 GGGCAGGGTA TCGGGTTCCG TTCGGAAAAA GACGAGGAAG CGAACAAGGA GCATAGCCGT 2220 

TCCGTGATCC AAAAAGCTCT GAACAAGACG TTCAGCCCCG AATTTCTCAA CCGTTTGGAC 22 8 0 

GATATCATCC TCTTCGACCA ACTGGGCAAG ACGGAGATTC GCCGGATGGT GGACATAGAG 234 0 

CTTAAAGCCG TCTTGGCGCG CATCCATCGT GCCGGATACG ACCTCGTCCT TACCGATGAA 2400 

GCCAAGGATG TGATAGCGAC GAAGGGATAC GACCTCCaAT ACGGAGCACG ACCGCTC7VAG 2460 

25 .CGCACACTCC AGAACGAAGT GGAGGATCGC CTCACGGATC TTATCCTCTC CGGACAGATC 2520 

GAGAAAGGGC AGACGCTTAC GCTCTCTGCT CGCGATGGCG AGATCATCGT ACAAGAACAA 2580 

GCA "83 

30 (2) INFORMATION FOR SEQ ID NO : 9 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 897 base pairs 

(B) TYPE: nucleic acid 
35 ( C ) STRANDEDNESS : double 

( D) TOPOLOGY : circular 

(ii) MOLECULE TYPE: DNA (genomic) 



40 



( iii ) HYPOTHETICAL : NO 
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(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 



(ix) FEATURE: 

(A) NAME/KEY: mis c_f eature 

(B) LOCATION 1 . . . 897 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 



TTAAAAAAGG AGATAACTAT GAAACAGAAC 
CTGGTTTTAC CCATGCTTAT TATGCCATTG 

15 GAAGAGGTGT TGGAATCATT GACTTTCGTT 
GCAGAGGTAG AAGCTCTGCA GGAGATAGTC 
CGTTCTTATA CCTACACGGT CTATCGTGAT 
CCCACTTTTC TCGATGAAGA TGTTCCTGCC 
TATCAGGGAG GCGTATCCGA CAAAGTATGC 

20 GTTACCAATC TCACCGGAAC TGCTTCCAAT 
GAAGAGAAAG CTGAAGAGCC GGCAAGTGAT 
GGAACCTTGA TCGGTAATAC AGCTGAAACT 
ACATACATCT ACGAAGTGGA AGTAAAGTAT 
ACCGTGACCG TGACCAACAG CTCATTGAGC 

25 CGAGTAGAAG GCAAGAAGAT TATTGCGGAA 
AACGGACGTA CCGTGGCCGT AGCCCCGAAT 
TATGCAGTGC GCTTCGATGT GGGGAATAAA 



TACTTCAAAA 


GAGTCTGCTC 


ACTGCTTTGG 


60 


GAAGTAGCAG 


CTCAAGAGAT 


TATTCCGAAC 


120 


GCACCGGTTG 


AGGAGACAGA 


CGCAATAGAG 


180 


GCTACTGAGG 


AGATTGCGGA 


GCAGGCTGTT 


240 


GGCGTGAAGA 


TTGCTTCAGG 


ATTGACTGAG 


300 


GGCGAACATA 


CCTACTGCGT 


AGAAGTACAG 


360 


GTGGACGTAG 


AGGTGAAGGA 


CTTCAAACCG 


420 


GACGAAGTTT 


CTTTGGACTG 


GGACGGTGTG 


480 


AAAGCAGTCA 


GCTACAACGT 


CTACAAGAAT 


540 


CATTATGTGG 


AGACCGGTGT 


AGCCAATGGT 


600 


CCTGACGGTG 


TATCTCCGAA 


GGTGGCTGTA 


660 


AATGTAGATG 


GACAGGCTCC 


TTACACATTG 


720 


GCCCATGGTA 


TGATCACGCT 


CTACGACATC 


780 


CGATTGGAAT 


ACATGGCGCA 


AACCGGTTTC 


840 


CACCATGTAT 


CGAAAATACA 


AGTAAGA 


897 



30 (2) INFORMATION FOR SEQ ID NO: 10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1392 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 



40 



(iii) HYPOTHETICAL: NO 



27 



10 



(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1. . .1392 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 



TACGGCAAAA GAAGAAAGCT CGGAACCTCC GTCCGACCTT CCGTTCTAAC CCAAATCAGA 60 

TTTATTTTGG ACTTACACCT TATTACTGAT TTTTTTGAAG GGCTTCGGGT CAATCCTATC 120 

15 GGTGCAGCAG CCATAGTGGC TTTCATTATC GACCTGCTTC TTCTTTGCTG TTCGGCTTTT 18 0 

ATGTCCTCCT GTGAGGTGGC TTATTTTTCA CTAAAGCCGA TCGATCTGCA GAACATCCGC 240 

GAACGGAATC ACTCTTCCGA CATCGCGCTT TCCAATTTAT TAGACAATTC GAATCAGCTA 300 

TTAGCTACTA TTCTGATCGG GAATAATGTG ATTAATGTAG CCATCGTTAT CCTTTCCAAT 360 

TATGCCATCG AGCAGACATT CGTTTTCTCT TCTCCGATCA TTGGATTTCT GATCCAGACG 42 0 

20 ATACTCCTGA CCACTGTTCT TTTGCTGTTC GGAGAGATTC TGCCGAAAGT GTATGCGCGG 480 

AAGAATCCGC TGCAATACTC GCGCTTTTCT GCTGCAGCTA TGTCCGTTAT CTATAAGATA 54 0 

TTGTCACCGT TTTCAAAATT GCTGGTCAAA AGTACCGGCA TCGTTACCAG AGGTATCAGC 600 

AAGAAGAAAT ACGATATGTC CGTGGATGAG CTCTCGAAAG CGGTAGCCCT CACCACTACG 660 

GAGGGAGAGC CGGAGGAGAA AGAAATGATT AACGAAATCA TCAAATTCTA TAATAAGACA 720 

25 -GCCTGCGAAA TCATGGTTCC GCGTATCGAT ATTGTGGATG TGGATCTGAG CTGGCCATTT 7 80 

CGTAAGATGC TTGACTTCGT TGTTTCGTCG GGTTATTCCA GACTTCCCGT TTCAGAGGGG 840 

TCAGAAGACA ATATCAAAGG GGTGATTTAC ATCAAAGATC TAATCCCACA CATGGATAAA 900 

GGCGATGAAT TCGACTGGCA TCCTCTGATT CGTAAAGCAT ATTTTGTCCC CGAAAACAAG 960 

CGCATAGATG ATTTGCTCGA GGAGTTCAGA GCCAATAAGG TGCATGTCTC CATCGTTGTG 1020 

30 GATGAGTTCG GTGGCACTTG CGGACTGATC ACAATGGAGG ACATATTGGA AGAGATCGTC 108 0 

GGCGAGATTA CGGACGAGTA CGATGAGGAA GAACTCCCCT TTAAGGTTTT GGGGGATGGC 114 0 

AGTTATCTTT TCGAAGGAAA AACGTCTCTC TCCGATGTTC GACACTATCT TGACCTTCCG 1200 

GAAAATGCTT TCGGTGAATT GGGGGACGAG GTAGATACGC TAAGTGGGCT CTTCTTGGAA 12 60 

ATCAAGCAGG AACTCCCCCA TGTGGGCGAT ACAGCAGTGT ACGAGCCATT CCGCTTTCAA 132 0 

35 GTGACCCAAA TGGACAAGCG CCGAATCATC GAAATCAAGA TTTTCCCTTT CGAGCGCACT 138 0 

1392 

TGGGAGGTCG AA 



40 



(2) INFORMATION FOR SEQ ID NO: 11 



28 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2607 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORG7VNISM: Porphyromonas gingivalis 



15 

(ix) FEATURE: 



(A) NAME/KEY: misc_feature 

(B) LOCATION 1...2607 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 

TGGCATAGGA ATATTTTTAT CTTTGCGAGT ACATTTAGCC CGAAAAATAT GCTCCCACTG 60 

CCATACCGTT ATGCAAAAAC CGAGCACCTT TTTCTCGCAA AAGGATACTG CAAGAATCCA 12 0 

ATAACAAACA TAATTATCCT ATTTATGAAG AAAAAGAATT TTTTGCTTCT TGGCATTTTC 18 0 

25 -GTTGCTTTGC TGACTTTCAT CGGCAGCATG CAGGCACAAC AGGCCAAAGA TTATTTCAAC 24 0 

TTTGACGAAC GGGGCGAGGC CTACTTCTCA TTCAAAGTGC CTGATAGGGC CGTTCTACAA 300 

GAGCTGGCTC TGATCATGTC CATCGACGAG TTTGACCCCG TAACCAATGA AGCCATTGCC 360 

TATGCCAGCG AAGAGGAGTT CGAGGCATTC CTGCGCTATG GGCTCAAGCC TACATTCTTG 42 0 

ACTCCTCCAT CCATGCAGCG CGCTGTCGAG ATGTTCGACT ACCGCTCAGG AGAAAAATAC 480 

30 GAATGGAATG CTTACCCCAC CTATGAAGCC TATATCAGCA TGATGGAAGA GTTCCAAACA 54 0 

AAGTATCCAT CACTTTGTAC TACTTCCGTC ATTGGCAAGT CCGTAAAGGA TCGTAAACTG 600 

ATGATTTGCA AGCTGACGTC CTCTGCCAAT ACAGGGAAAA AGCCTCGCGT GCTCTATACT 660 

TCTACGATGC ACGGAGACGA AACGACCGGA TATGTGGTAC TGCTCCGACT CATAGACCAT 72 0 

CTGCTGTCGA ACTACGAATC CGATCCGAGG ATTAAGAACA TTCTGGATAA AACGGAAGTA 780 

35 TGGATCTGCC CTTTGACCAA TCCGGACGGA GCATACAGAG CCGGAAACCA CACCGTACAA 840 

GGAGCTACTC GCTACAATGC CAACAATGTC GATTTGAACC GTAACTTCAA GGATGATGTA 900 

GCCGGTGATC ACCCCGATGG AAAACCTTGG CAGCCGGAGG CAACTGCATT CATGGATTTG 960 

GAAGGAAACA CCTCTTTCGT GCTCGGTGCC AATATACATG GAGGAACAGA GGTGGTGAAC 1020 

TATCCATGGG ATAATAAAAA AGAAAGACAT GCAGACGATG AGTGGTACAA ACTGATCAGT 1080 

40 CGCAACTACG CAGCCGCTTG TCAGAGTATT TCCGCCAGCT ACATGACCTC CGAAACC/^T 1140 



TCGGGAATCA TCAACGGTTC AGACTGGTAT 
AATTATTTCC ATCGTCTGCG AGAAATTACC 
GCCTCTCAAC TTCCAAAGTA TTGGAATCTG 
GAATCCTTAT ACGGCATCCA TGGTACAGTG 
5 TGCCAGATCT TGATAGAAAA CCATGACAAG 
ACAGGCTACT ACGTACGTCC TATCAAAGCC 
GGTTATCCTG AGGCAACTCG TACCATTACG 
ATTGCATTGG GCAACTCGGT TCCTCTGCCT 
ATCTCAGTAG GCGAAAGCGT CCAATTCC7VA 
10 GAGTGGACGT TCGAAGGCGG ACAGCCTGCC 
TATAGTCATC CCGGTCAGTA CGACGTTACG 
ACGATTACGA AAGAAAAATT CATCACTGTC 
GGTACCCCGA CGGAAATAGA AGAGGGCCAG 
GCCACCAACT ACGTATGGAT ATTCGATGGC 
15 CCGACTGTGC TTTACAGCAA AGCCGGCCAA 
TCCGGTGAAA CGGTGAAGAC GAAAGAAAAA 
GCTCCGGTAG CCGACTTCGA AGGAACACCT 
TTCAAAGACT TGTCTACGAA CAATCCGACT 
CCTGCCACCA GCACGGAGCA AAACCCGGTG 
20 GTCCAGCTGA CTGCCACCAA CGAGGGCGGA 
GAGGTTATCC TCGATGACAG TGTCGAGGAC 
CCGCAAAACG GAACGAAGCA GATCCTCATA 
CTCTATGACA TCAATGGACG GGTCGTACTC 
GTAGATCTTT CCATCCTGCC CGAAGGAATC 
25 .GCTCGCACGG AAAAGATCCA TATCGGG 
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GTAATTCGCG 


GAAGTCGTCA 


GGACAATGCA 


1200 


CTTGAAATCA 


GCAACACGAA 


GTTGGTGCCG 


1260 


AACAAAGAAT 


CTCTGCTTGC 


TCTGATCGAA 


1320 


ACTTCCGCTG 


CGAACGGACA 


GCCTCTCAAA 


1380 


CGCAACTCCG 


ATGTTTACTC 


CGATGCTACC 


1440 


GGCACTTATA 


CGGTGAAATA 


CAAAGCCGAG 


1500 


ATCAAGGACA 


AAGAAACCGT 


CATCATGGAC 


1560 


GTACCCGATT 


TCACAGCTTC 


TCCTATGACC 


1620 


GATCAAACGA 


CAAATAACCC 


CACGAATTGG 


1680 


ATGAGTACAG 


AGCAGAATCC 


GCTCGTATCC 


1740 


CTCAAAGTGT 


GGT^TGCAAG 


TGGTTCCAAC 


1800 


AATGCCGTTA 


TGCCTGTAGC 


TGAATTCGTC 


1860 


ACGGTATCTT 


TCCAAAACCA 


ATCCACCAAT 


1920 


GGCACTCCCG 


CTACCAGTGA 


AGACGAAAAC 


1980 


TACGATGTCA 


CGCTCAAGGC 


GATCAGTGCT 


2040 


TACATCACTG 


TCAAGAAAGC 


TCCGGTCCCT 


2100 


CGAAAAGTAA 


AGAAAGGCGA 


GACAGTTACT 


2160 


TCATGGCTTT 


GGGTGTTCGA 


AGGCGGCTCT 


2220 


GTCACCTACA 


ATGAAACAGG 


CAAGTACGAT 


2280 


AGCAATGTGA AGAAAGCAGA AGACTACATT 


2340 


ATAGTGGCAC 


AGACGGGTAT 


CGTCATTCGT 


2400 


GAAGCCAACG 


CTGCTATCAA 


AGCGATCGTT 


2460 


AAAACTACTC 


CGAATCAGCT 


CCGCTCGACC 


2520 


TACACCATCA 


ATATCAAAAC 


GGAAAAATCC 


2580 
2607 



(2) INFORMATION FOR SEQ ID NO: 12 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Porphyromonas gingivalis 



30 



(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1. . . 84 9 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 

Phe Gly lie Ser Pro Ser Met Lys Lys Ser Phe Leu Leu Ala lie Val 

IS 10 15 

Met Leu Phe Gly He Ala Met Gin Gly His Ser Ala Pro Val Thr Lys 

20 25 30 

Glu Arg Ala Leu Ser Leu Ala Arg Leu Ala Leu Arg Gin Val Ser Leu 

35 40 45 

Arg Met Gly Gin Thr Ala Val Ser Asp Lys He Ser He Asp Tyr Val 

30 55 60 

Tyr Arg Gin Gly Asp Ala Glu Arg Gly He Thr Ser Gin Glu Glu Gly 
65 70 75 80 

Ser Pro Ala Tyr Phe Tyr Val Ala Asn Arg Gly Asn Asn Glu Gly Tyr 

85 90 95 

Ala Leu Val Ala Ala Asp Asp Arg He Pro Thr He Leu Ala Tyr Ser 

100 105 110 

Pro He Gly Arg Phe Asp Met Asp Ser Met Pro Asp Asn Leu Arg Met 

115 120 125 

Trp Leu Gin He Tyr Asp Gin Glu He Gly Leu He Leu Ser Gly Lys 

130 135 140 

Ala Gin Leu Asn Glu Glu He Leu Arg Thr Glu Gly Val Pro Ala Glu 
145 150 155 160 

Val His Ala Leu Met Asp Asn Gly His Phe Ala Asn Asp Pro Met Arg 

165 170 175 

Trp Asn Gin Gly Tyr Pro Trp Asn Asn Lys Glu Pro Leu Leu Pro Asn 

180 185 190 

Gly Asn His Ala Tyr Thr Gly Cys Val Ala Thr Ala Ala Ala Gin He 

195 200 205 

Met Arg Tyr His Ser Trp Pro Leu Gin Gly Glu Gly Ser Phe Asp Tyr 

210 215 220 

His Ala Gly Ser Leu Val Gly Asn Trp Ser Gly Thr Phe Gly Glu Met 
225 230 235 240 

Tyr Asp Trp He Asn Met Pro Gly Asn Pro Asp Leu Asp Asn Leu Thr 

245 250 255 

Gin Ser Gin Val Asp Ala Tyr Ala Thr Leu Met Arg Asp Val Ser Ala 
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260 265 270 

Ser Val Ser Met Ser Phe Tyr Glu Asn Gly Ser Gly Thr Tyr Ser Val 

275 280 285 

Tyr Val Val Gly Ala Leu Arg Asn Asn Phe Arg Tyr Lys Arg Ser Leu 
5 290 295 300 

Gin Leu His Val Arg Ala Leu Tyr Thr Ser Gin Glu Trp His Asp Met 
305 310 315 320 

He Arg Gly Glu Leu Ala Ser Gly Arg Pro Val Tyr Tyr Ala Gly Asn 
325 330 335 

10 Asn Gin ser He Gly His Ala Phe Val Cys Asp Gly Tyr Ala Ser Asp 
340 345 350 

Gly Thr Phe His Phe Asn Trp Gly Trp Gly Gly Val Ser Asn Gly Phe 

355 360 365 

Tyr Lys Leu Thr Leu Leu Ser Pro Thr Ser Leu Gly He Gly Gly Glu 
15 370 375 380 

Gly He Gly Phe Thr He Tyr Gin Glu He He Thr Gly He Glu Pro 
385 390 395 400 

Ala Lys Thr Pro Ala Glu Ala Gly Thr Asp Ala Leu Pro He Leu Ala 
405 410 415 

20 Leu Lys Asp He Glu Ala Glu Tyr Lys Ser Glu Ser Gly Leu Asn Val 
420 425 430 

Gly Tyr Ser He Tyr Asn Thr Gly Glu Glu Gin Ser Asn Leu Asp Leu 

435 440 445 

Gly Tyr Arg Leu Asn Lys Ala Asp Gly Glu Val He Glu Val Lys Thr 
25 450 455 460 

Ser Ser He Asn He Ser Trp Tyr Gly Tyr Gly Glu His Pro Glu Ser 

Ain 475 480 

465 470 ^'3 

Phe Ser Leu Ala Pro Asn Gin Leu Ser Gin Gly He Asn Thr He Thr 
485 490 495 

30 Leu Leu Tyr Arg Arg Thr Gly Thr Glu Gin Trp Glu Pro Val Arg His 
500 505 510 

Ala Gin Gly Gly Tyr Val Asn Ser He Lys Val Asn Thr Thr Asp Pro 

515 520 525 

Asn Asn Val Val Val Thr Val Asp Asn Asn Glu Gly Lys Leu Ser He 
35 530 535 540 

Val Pro Asn Ser Phe Val Ala Asp Leu Asn Ser Tyr Glu His Ser Thr 
545 550 555 560 

He Thr Val Gin Phe Asn Ser Asp Ser Pro Asp Glu He Arg Thr Pro 
565 570 575 

40 Val Ala Phe Ala Leu Ser Thr Gly Ala Thr Ala Asp Asp Val He Ser 
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Leu Gly Trp 
595 

Val Val Trp 
5 610 

Leu. Trp Tyr 
625 

lie Gly Ser 

10 Phe Glu Val 

His Asn Arg 
675 

Phe Asn Gly 
15 690 

Gly Ser Leu 
705 

Thr Phe Val 

20 Tyr Arg Ala 

Leu Lys Gly 
755 

Val Glu Ala 
25 770 

Ala Arg Asp 
785 

Ser lie lie 

30 Leu Ser Ala 

Ala Tyr lie 
835 

His 

35 



580 

Val Met Ala Glu 

Ser Lys Asp Val 
615 

Arg Phe Ser lie 
630 

Val Ser Val Lys 
645 

Gly His Asn Gin 
660 

Val Leu Pro Asp 

Glu Leu Val Val 
695 

Trp Ala Ala Gin 
710 

Tyr Lys Pro Val 
725 

Thr Leu His Ala 
740 

Lys Arg Asn Tyr 

lie Glu Ser Ser 
775 

Tyr Val Glu lie 
790 

Leu Phe Asp Leu 
805 

Gly His Gly Arg 
820 

Leu Lys Val Asp 



585 

Val Pro Gly Gly Ser 
600 

Leu Thr Leu Ser Glu 
620 

Asn Asn Gin Lys Asp 
635 

Thr Pro Thr Glu Tyr 
650 

Thr Ser Thr Tyr Thr 
665 

Phe Thr Leu Lys Asn 
680 

Val Phe Arg Gin Thr 
700 

Glu Thr Val His lie 
715 

Val Glu Gly Pro He 
730 

Phe Val Asn Gly Gin 
745 

Thr Val Lys He Val 
760 

Glu Glu He Arg Val 
780 

Ser Ala Pro Cys He 
795 

Ser Gly Lys He Val 
810 

Met Asp Val Ser Arg 
825 

Gly Tyr Thr Thr Lys 
840 



590 

Ser Asn Tyr Pro 
605 

Gly Asp Tyr Thr 

Glu Trp Lys Lys 
640 

Thr His Pro Leu 
655 

Leu Asp Met Ala 
670 

Leu Gly Leu Pro 
685 

Gin Ser Ser Ser 

Lys Gin Gly Glu 
720 

Pro Asp Gly Ser 
735 

Gin Gin Leu Tyr 
750 

Asn Gly Thr Ala 
765 

Phe Pro Asn Pro 

Pro Gin Glu Thr 
800 

Met Lys Asn Ser 
815 

Leu Pro Asn Gly 
830 

He Asn He Val 
845 



(2) INFORMATION FOR SEQ ID NO: 13 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1269 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1...1269 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 

Val Lys Cys Met Gly Lys Tyr Lys Arg Ala Lys Tyr Arg Tyr Trp Leu 

1 5 . 10 15 

Phe Pro Phe Cys Ser Asp Tyr Tyr Thr Phe Glu Gly Val Thr Phe Leu 

20 25 30 

Cys Ala Ser Asp Asp Met Thr Thr Lys Lys Pro Gin Ala lie Leu Asp 

35 40 45 

Leu Glu Lys Ala Tyr Asn lie Glu lie Pro Asp Leu Ser Ser Gin Glu 

50 55 60 

.Gly lie Ser Trp Ser Val Asn Arg Tyr Phe Lys Gin Asp Ser Ser Gly 
65 70 75 80 

Ala Val Val Glu Leu Cys Leu Arg Glu Cys Gin lie Glu Ser Met Thr 

85 90 95 

Trp Leu lie Asp Phe Pro Ala Leu Lys Lys Leu Asp Leu Ser Tyr Asn 

100 105 110 

Gin He Ser Lys Leu Glu Gly Leu Glu Arg Leu Thr Ser Leu Thr Lys 

115 120 125 

Leu Arg Leu Arg Ser Asn Gin He Arg Lys Leu Glu Gly Leu Asp Ser 

130 135 140 

Leu Thr Ser Leu Thr Lys Leu Ser Leu Ser Asp Asn Gin He Ser Lys 
145 150 155 160 

Leu Glu Gly Leu Glu Arg Leu Thr Ser Leu Ala Glu Leu Tyr Leu Leu 

165 170 175 

Asp Asn Gin He Ser Lys Leu Glu Gly Leu Glu Arg Leu Thr Ser Leu 
180 185 190 
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Ala Thr Leu Glu Leu Ser Gly Asn Gin He Arg Lys Leu Glu Gly Leu 

195 200 205 

Glu Arg Leu Thr Ser Leu Ala Thr Leu Glu Leu Ser Gly Asn Gin He 
210 215 220 

5 Arg Lys Leu Glu Gly Leu Glu Arg Leu Thr Ser Leu Thr Lys Leu Arg 
225 230 235 240 

Leu Arg Ser Asn Gin He Ser Lys Leu Glu Gly Leu Glu Arg Leu Thr 

245 250 255 

Ser Leu Ala Thr Leu Glu Leu Ser Gly Asn Gin He Arg Lys Leu Glu 
10 260 265 270 

Gly Leu Glu Arg Leu Thr Ser Leu Ala Thr Leu Glu Leu Ser Gly Asn 

275 280 285 

Gin He Ser Lys Leu Glu Gly Leu Glu Arg Leu Ser Ser Leu Thr Lys 
290 295 300 

15 Leu Arg Leu Arg Ser Asn Gin He Ser Lys Leu Glu Gly Leu Glu Arg 
305 310 315 320 

Leu Thr Ser Leu Thr Lys Leu Ser Leu Ser Asp Asn Gin He Ser Lys 

325 330 335 

Leu Glu Gly Leu Glu Arg Leu Thr Ser Leu Ala Glu Leu Tyr Leu Leu 
20 340 345 350 

Asp Asn Gin He Arg Lys Leu Glu Gly Leu Glu Arg Leu Thr Ser Leu 

355 360 365 

Thr Lys Leu Arg Leu Arg Ser Asn Gin He Ser Lys Leu Glu Gly Leu 
370 375 380 

25 -Asp Ser Leu Thr Ser Leu Thr Lys Leu Ser Leu Ser Asp Asn Gin He 
385 390 395 400 

Ser Lys Leu Glu Gly Leu Glu Arg Leu Thr Ser Leu Ala Glu Leu Tyr 

405 410 415 

Leu Leu Asp Asn Gin He Arg Lys Leu Glu Gly Leu Asp Gly Leu Ala 
30 420 425 430 

Ser Leu Thr Arg Leu Ser Leu Arg Arg Asn Gin He Ser Lys Leu Glu 

435 440 445 

Gly Leu Asp Arg Leu Lys Val Leu Arg Lys Leu Asp Val Ser Gly Asn 
450 455 460 

35 Asp He Gin Ser He Asp Asp He Lys Leu Leu Ala Pro He Leu Glu 
465 470 475 480 

Gin Thr Leu Glu Lys Leu Arg He His Asp Asn Pro Phe Val Ala Ser 

485 490 495 

Ser Gly Leu He Leu Ser Pro Tyr Asp Asn His Leu Pro Glu He Lys 
40 500 505 510 
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Ala Leu Leu Glu Lys Glu Lys Glu Lys Gin Lys Lys Thr Ser Val Glu 

515 520 525 

Tyr His Pro Phe Cys Lys Val Met Leu Leu Gly Asn His Ser Ser Gly 

530 535 540 

Lys Thr Thr Phe Leu Ser Gin Tyr Asp Thr Asn Tyr Thr Tyr Gin Lys 
545 550 555 560 

Asn Thr His Val Leu Ser lie His Arg Ser Asn Asn Pro Asn Ala lie 

565 570 575 

Phe Tyr Asp Phe Gly Gly Gin Asp Tyr Tyr His Gly He Tyr Gin Ala 

580 585 590 

Phe Phe Thr Thr Gin Ser Leu Tyr Leu Leu Phe Trp Asp Ala Lys Lys 

595 600 605 

Asp Arg Asn Phe Val Ser Val Asp Asp Lys Glu Tyr Gin Thr Leu Asn 

610 615 620 

Phe Asn Arg Pro Tyr Trp Leu Gly Gin He Ala Tyr Ala Cys Asn Arg 
625 630 635 640 

cys Met Ser Val Gly Gly Asn Pro Asp Gly Lys Asp Thr Pro Gin Thr 

645 650 655 

Thr Asp Asp Thr He He He Gin Thr His Ala Asp Glu Thr Gly Ala 

660 665 670 

Lys Gin Gin Thr Leu Gly Cys Ala Ala Glu Asn Gly Val Leu Glu Glu 

675 680 685 

He Tyr Val Ser Leu Glu Pro Lys Ala Asn Ser Ala Val His Ala Leu 

690 695 700 

Asn Tyr Leu Asn Glu Arg Val Arg Glu Val Val Ala Ser Arg Ser Lys 
705 710 715 720 

Ser He Gin He Thr Glu Lys Asp Lys Gly Leu Tyr Glu Ala Leu Pro 

725 730 735 

Thr He Ala Gly Asp Asn Lys His He Pro He Ser Leu Glu Ala Leu 

740 745 750 

Ala Ala Gin Leu Asn Lys Gly Arg Ala Glu Asn Asp Leu Tyr Thr He 

755 760 765 

Glu Tyr Leu Gin Thr Glu Leu Asn Gin Leu Ser Leu Arg Gly Glu Val 

770 775 780 

Leu Tyr Tyr Arg Glu Asn Glu Lys Leu Asn Asn Tyr Val Trp Leu Asp 
785 790 795 800 

Pro Ala Ala Phe Val Gin Met He His Gly Glu He Leu Gin Lys Asp 

805 810 815 

Asn He Asn Arg Gly Thr Val Pro Lys Asp He Phe Glu Cys Lys Leu 
820 825 830 
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His Asn Leu Ser Ser Gly Ser lie Phe Glu Glu Asp Gly Gin Asn Gly 

835 840 845 

Asn Met lie Leu Gin Leu Leu Leu Glu Glu Leu lie Val Tyr Glu Asp 

850 855 860 

Lys Asp Cys Tyr Val lie Pro Gly Tyr Leu Pro Leu His Ser Asp Asp 
865 870 875 880 

Glu Ala Tyr Lys Trp Leu Thr Leu Gly Phe Glu Arg Pro Asn Phe Val 

885 890 895 

Leu Lys Phe Glu Arg Phe lie Pro Phe Gly Leu lie Asn Gin lie lie 

900 905 910 

Ala Tyr Tyr Gly Arg Glu Glu Gly Ala Leu Lys Arg Tyr Trp Arg Asp 

915 920 925 

Gin Val He Phe Thr Ala Gly Arg Glu Met Asp Arg Gin Thr Leu Glu 

930 935 940 

Gin Glu Glu Glu Lys Glu Gly Leu Pro Lys Thr Asn Ala Glu Asp Tyr 
945 950 955 960 

Gin He Trp He Lys Leu Asp Phe Thr Asp Leu Ala lie Ser Val Phe 

965 970 975 

He Lys Glu Gin Arg Lys Thr Ser Ala Lys Asp Met Gin Arg Lys Glu 

980 985 990 

Ala Thr He Leu Ser Asp Met Leu Asp Met Tyr Trp Asn Asn He Pro 

995 1000 1005 

Pro Arg Glu Gin He Gly Asp Lys Asp Thr Glu Gin Thr Arg Ser Thr 

1010 1015 1020 

.He Arg Glu Thr Asn Arg Lys Lys Arg Pro He Gin Asp Leu Tyr Leu 
1025 1030 1035 1040 

Ser Cys Ala Gin Ala Asp Lys Asp Leu Thr Glu Ser His Tyr He His 

1045 1050 1055 

Leu Gly Thr Leu Asp Asp Glu Ser Lys Thr Thr Ala Arg He Ala Ala 

1060 1065 1070 

Tyr Pro Leu Lys Asn Gly Val He Asp Lys Glu Arg Val Arg Glu Val 

1075 1080 1085 

Ser Thr Arg Pro Tyr Lys His Leu Ser Val Asn Lys Asn Leu Ala Thr 

1090 1095 1100 

Ala Lys Gin He Phe He Ser Tyr Ser Lys Glu Asp Gin Thr Glu Leu 
1105 1110 1115 1120 

Glu Thr Cys Leu Gin Phe Phe Lys Pro Leu Glu Lys Asn Gly Gin He 

1125 1130 1135 

Glu He Tyr Tyr Asp Lys Leu Thr Lys Phe Glu Thr Pro He His Pro 
1140 1145 1150 
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Glu lie Arg Lys Arg lie Val Glu Ala Asp Cys lie lie Ala Leu lie 

1155 1160 1165 

Ser Gin Arg Tyr Leu Ala Thr Asp Tyr lie Leu Asp His Glu Leu Pro 

1170 1175 1180 

Val Phe Arg Glu Tyr Asn Lys Thr He Val Pro He Leu He Lys Pro 
1185 1190 1195 1200 

Cys Thr Phe Glu Asp Asp Glu Phe Leu Arg Glu Lys Tyr Phe Ala Gin 

1205 1210 1215 

Lys Ala Gin He He Asn Leu Gly Lys Glu Gly Lys Thr He Lys Ala 

1220 1225 1230 

Tyr Asp Ser He Thr Ala Ser Ala His Arg Asp Glu Asn Trp Val Ala 

1235 1240 1245 

Val Val Arg Glu Phe Lys Glu Lys He Leu Arg He Thr Lys Gin Glu 

1250 1255 1260 

Val Asn Thr Asp Glu 
1265 



(2) INFORMATION FOR SEQ ID NO: 14 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 377 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 



(ix) FEATURE: 

(A) NAME/KEY: mis cofeature 

(B) LOCATION 1...377 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 

Leu Phe Asn His Lys Lys Ser Trp Tyr Glu Thr Phe Gin Phe Tyr Leu 

15 10 15 

Val Val Phe Arg Ser Gly Phe He Leu Leu Cys Glu Asn Thr Leu Ala 



Gin Gin Lys 
35 

Ala Tyr Gly 
5 50 

Pro Met lie 
65 

Trp Lys Thr 

10 Thr Asn Phe 

He Gly Gly 
115 

Glu Leu Lys 
15 130 

Cys Thr Gin 

145 

Ser Ser Thr 

20 Glu Thr Leu 

Asn Arg Thr 
195 

Asp Thr Lys 
25 210 

Trp Leu Asn 
225 

Arg Ala Pro 

30 Asp Gly Gin 

Thr Asp Val 
275 

Leu Gin Gly 
35 290 

Ser Ala Asn 
305 

Gly Lys Thr 
40 Asp He Arg 



20 

Thr Glu Glu Phe 

Ser Thr Val Phe 
55 

Pro Leu Ser Glu 
70 

He Asp Ala Asp 
85 

Thr Gly Gin Ser 
100 

Val Gly Ala Leu 

Leu Pro Thr Asp 
135 

Asp Leu Thr Ala 
150 

Gly Asn Asn Ala 
165 

Thr Ala Lys Arg 
180 

Gin Gly Val Trp 

Tyr Val Ala Phe 
215 

Leu Asp Glu Val 
230 

Cys Pro His Pro 
245 

Lys He Ala Ser 
260 

Pro Tyr Gly Thr 

Asp Ser Tyr Lys 
295 

He Tyr Gly Ala 
310 

He Val Ala Ser 
325 

Gly Arg Leu He 
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Ala Pro Val Ser Asp 
40 

Leu His Trp Thr Pro 
60 

Ser Phe Glu Ser Gly 
75 

Gly Asp Gly Tyr Asn 
90 

Gly Leu Cys Val Ser 
105 

Thr Pro Asp Asn Tyr 
120 

Ala Leu Val Glu He 
140 

Pro Ser Glu His Tyr 

155 

Ala Asp Phe Val Asn 
170 

He Gin Ser Pro Glu 
185 

Tyr Gin Arg Lys Val 
200 

Arg His Phe Asn Ser 
220 

Ser He Leu Tyr Thr 
235 

Gly Gly Tyr Thr Tyr 
250 

Gly Leu Ser Ala Leu 
265 

Gin Asp Tyr Cys Val 
280 

Val Cys Lys Asn He 
300 

Asp Lys Pro Phe Ala 
315 

Ala Phe Lys Gly Glu 
330 

Ala Ser Gly Cys Asp 



30 

Leu Arg Ala Glu 
45 

Pro Tyr Asp Asn 

He Pro Ala He 
80 

Trp Met His Leu 
95 

Ser Ala Ser Tyr 
110 

Leu He Thr Pro 
125 

He Tyr Trp Val 

Ala Val Tyr Ser 
160 

Leu Leu Tyr Glu 
175 

Leu He Arg Gly 
190 

Val Leu Pro Asn 
205 

Thr Asp Asn Phe 

Pro Leu Pro Arg 
240 

Ser Val Phe Arg 
255 

Ala Tyr He Asp 
270 

Gin Val Asn Tyr 
285 

Val Val Ala Asn 

Leu Thr Val Val 
320 

He Thr Leu Tyr 
335 

Thr Leu Arg Tyr 
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340 345 350 

Lys Ala Glu Asn Gly Phe Tyr Leu He Lys He Gin Val Asn Gly Thr 

355 360 365 

Val Tyr Thr Glu Lys He Gin He Gin 
370 375 

(2) INFORMATION FOR SEQ ID NO: 15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 763 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/ KEY: misc_feature 

(B) LOCATION 1. . .763 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 

His Phe Leu Val Thr He Met Arg Thr Lys Thr He Phe Phe Ala He 

15 10 15 

He Ser Phe He Ala Leu Leu Ser Ser Ser Leu Ser Ala Gin Ser Lys 

20 25 30 

Ala val Leu Thr Gly Ser Val Ser Asp Ala Glu Thr Gly Glu Pro Leu 

35 40 45 

Ala Gly Ala Arg He Glu Val Lys His Thr Asn He Val Ala Gly Ala 

50 55 60 

Asp Ala Gly Gly His Phe Glu He Lys Asn Leu Pro Ala Gly Gin Hi 
65 70 "75 

Thr He He Cys Ser Leu Gly Gly Tyr Gly Gin Lys Glu Glu Val Val 

85 90 95 

Ala He Glu Ala Gly Gin Thr Lys Thr He Ser Phe Ala Leu Arg Leu 
100 105 110 



s 
80 
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Arg Thr Asn Asn Leu Glu Glu Val Val Val Thr Gly Thr Gly Thr Arg 

115 120 125 

Tyr Arg Leu Val Asp Ala Pro Val Ala Thr Glu Val Leu Thr Ala Lys 

130 135 140 

Asp lie Ala Ser Phe Ser Ala Pro Thr Ser Glu Ala Leu Leu Gin Gly 
145 150 155 160 

Leu Ser Pro Ser Phe Asp Phe Gly Pro Asn Leu Met Gly Ser Phe Met 

165 170 175 

Gin Leu Asn Gly Leu Ser Ser Lys Tyr lie Leu lie Leu lie Asp Gly 

180 185 190 

Lys Arg Val Tyr Gly Asp Val Gly Gly Gin Ala Asp Leu Ser Arg lie 

195 200 205 

Ser Pro Asp Gin lie Glu Arg lie Glu Leu Val Lys Gly Ala Ser Ser 

210 215 220 

Ser Leu Tyr Gly Ser Asp Ala lie Ala Gly Val lie Asn Val lie Thr 
225 230 235 240 

Lys Lys Asn Thr Asn Arg Leu Ser Ala Tyr Thr Ser His Arg lie Ser 

245 250 255 

Lys Tyr Asn Asp Arg Gin Thr Asn Thr Ser Leu Asp lie Asn lie Gly 

260 265 270 

Lys Phe Ser Ser Asn Thr Asn Tyr Phe Phe Tyr His Thr Asp Gly Trp 

275 280 285 

Gin Asn Ser Pro Phe Glu lie Lys Lys Lys Lys Gly Ser Gly Glu Pro 

290 295 300 

Val Leu Glu Glu Thr Tyr Lys Lys Thr Phe Arg Ala Gin Glu Asn Gin 
305 310 315 320 

Gly Val Ser Gin Ser Leu Ser Tyr Tyr Ala Thr Asn Asn Leu Ser Phe 

325 330 335 

Ser Gly Asn Val Gin Tyr Asn Lys Arg Gin lie Phe Thr Pro Thr Phe 

340 345 350 

Ser Glu Lys Lys Ala Tyr Asp Met Asp Tyr Arg Ala Leu Thr Ala Ser 

355 360 365 

Leu Gly Thr Asn Tyr Leu Phe Pro Asn Gly Leu His Thr Leu Ser Phe 

370 375 380 

Asp Ala Val Tyr Asp Arg Phe Arg Phe Gly Tyr Leu Tyr His Asp Lys 
385 390 395 400 

Asp Ser Ser Glu Ser Leu lie Asn Asn Gin Gly Gin Thr Glu Gin Pro 

405 410 415 

Thr Phe Phe Pro Gly Gin Leu Arg Asn Lys Asn Asp Gin lie Arg Tyr 
420 425 430 
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Thr Ala Glu Ala Arg Gly Val Phe Thr Leu Pro Tyr Ala Gin Lys Leu 

435 440 445 

Thr Gly Gly Leu Glu Tyr Phe Arg Glu Glu Leu He Ser Pro Tyr Asn 

450 455 460 

Leu He Thr Asp Lys Ala Asp Ala Ser Thr Leu Ser Ala Tyr Val Gin 
465 470 475 480 

Asp Glu Trp Lys Pro Leu Asp Trp Phe Asn Met Thr Ala Gly Phe Arg 

485 490 495 

Leu Val His His Gin Glu Phe Gly Thr Arg Met Thr Pro Lys Val Ser 

500 505 510 

He Leu Ala Lys Tyr Gly Pro Leu Asn Phe Arg Ala Thr Tyr Ala Asn 

515 520 525 

Gly Tyr Lys Thr Pro Thr Leu Lys Glu Leu Phe Ala Arg Asn Glu Leu 

530 535 540 

Thr Thr Met Gly Ser His Asn Leu Tyr Leu Gly Asn Ala Asp Leu Lys 
545 550 555 560 

Pro Gin Met Ser Asp Tyr Tyr Ala Leu Gly Leu Glu Tyr Asn Gin Gly 

565 570 575 

Pro He Ser Phe Ser Ala Thr Val Tyr Asp Asn Glu Leu Arg Asn Leu 

580 585 590 

He Ser Phe Met Asp He Pro Thr Ser Pro Glu His Glu Ala Gin Gly 

595 600 605 

He Lys Lys Thr Lys Gin Tyr Ala Asn He Gly Lys Ala Arg Ser Arg 

610 615 620 

Gly Leu Asp Val Leu Cys Asp Ala Ser He Gly Trp Gly He Lys Leu 
625 630 635 640 

Gly Ala Gly Tyr Ser Leu Val Glu Ala Lys Asn Leu Gin Thr Asp Glu 

645 650 655 

Trp Leu Glu Gly Ala Ala Arg His Arg Ala Asn Val His Ala Asp Trp 

660 665 670 

Val His Tyr Trp Gly Gin Tyr Arg Leu Gly Val Ser Leu Phe Gly Arg 

675 680 685 

He Gin Ser Glu Arg Tyr Tyr Lys Asp Gly Asn Ala Pro Asp Tyr Thr 

690 695 700 

Leu Trp Arg Leu Ala Thr Ser His Arg Phe Ala His Phe Arg His He 
705 710 715 720 

He Leu Asp Gly Thr Leu Gly He Asp Asn Leu Phe Asp Tyr Val Asp 

725 730 735 

Asp Arg Pro Met Gly Val Asn Tyr Ala Thr Val Thr Pro Gly Arg Thr 
740 745 750 
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Phe Phe Ala Gin lie Ala lie Arg Phe Asn Asn 
755 760 

(2) INFORMATION FOR SEQ ID NO: 16 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 582 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival! s 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1, . .582 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 

Ser Lys lie Val Leu Arg Lys Phe Cys Thr Leu Ala Arg Met Lys Lys 

^5 10 15 

Thr Asn Leu Phe Leu Ser Leu Leu Val lie Phe lie Thr Gly Ser Phe 

20 25 30 

Met Thr Ala Cys Ala Gin Lys Ser Lys Thr Asn Lys Leu Thr Glu Glu 

35 40 45 

Asp Arg Ser Arg Asn Glu Tyr Val Gin Ser Met Asp Val Leu Ser Asn 

50 55 60 

lie lie Gly Asn Val Arg Leu Tyr Phe Val Asp Thr lie Ser lie Lys 
65 70 75 80 

His Met Thr Arg Arg Gly He Asp Ala Met Leu Gly Gly Leu Asp Pro 

85 90 95 

Tyr Thr Glu Tyr He Pro Tyr Glu Glu Met Asp Glu Leu Lys Leu Met 

100 105 110 

Thr Thr Gly Glu Tyr Ala Gly Val Gly Ala He He Ser Gin Arg Pro 

115 120 125 

Asp Ser Ala Val He He Gin Arg Pro Met Glu Gly Met Pro Ala Asp 
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130 135 140 

Glu Ala Gly Leu He Ala Gly Asp Arg He Leu Thr He Asp Gly Lys 
145 

Asp Phe Arg Lys Ser Thr Thr Pro Lys Val Ser Gin Ala Leu Lys Gly 

165 170 175 

He Ala Gly Thr Val Ala Lys Val Thr Val Met Arg Tyr Gly Glu Thr 

180 185 
Lys Pro Arg Thr Phe Ser Val Lys Arg Gin Lys Val He Met Asn Ser 

195 200 205 

Val Thr Tyr Ser Gly Met Leu Asp Gly Ser He Gly Tyr He Arg Leu 

210 215 220 

Asn Asn Phe Thr Asp Lys Ser Ala Glu Glu Val Arg Thr Ala Leu Leu 
225 230 235 240 

Asp Leu Arg Asp Lys Gin Gly Ala Lys Gly Leu He Leu Asp Leu Arg 

245 250 255 

Gly Asn Gly Gly Gly Leu Met Gin Ala Ala He Glu He Val Asn Leu 

260 265 270 

Phe Val Pro Lys Gly Lys Glu Val Val Thr Thr Lys Gly Arg He Ala 

275 280 285 

Glu Ser Ala Ser Val Phe Arg Thr Leu Thr Glu Pro He Asp Thr Lys 

290 295 300 

Leu Pro He Val Val Leu He Asp Gly Gin Ser Ala Ser Ser Ser Glu 

He val Ala Gly Ala Leu Gin Asp Met Asp Arg Ala Val Leu Met Gly 

325 330 335 

Gin Lys Ser Tyr Gly Lys Gly Leu Val Gin Thr Thr Arg Gin Leu Pro 

340 345 350 

Tyr Asn Gly Val He Lys Leu Thr Thr Ala Lys Tyr Tyr He Pro Ser 

355 360 365 

Gly Arg Cys He Gin Arg Leu Asp Tyr Ser Arg Thr Asn Arg Thr Gly 

370 375 380 

Met Ala Thr Ala He Pro Asp Ser Leu His Lys He Phe Tyr Thr Ala 
385 390 395 400 

Ala Gly Arg Arg Val Glu Asp Ala Gly Gly He Leu Pro Asp He Glu 

405 410 415 

val Lys Gin Asp Thr Ala Ala Thr Leu Leu Tyr Tyr Met Ala He Asn 

420 425 430 

Asn Asp Val Phe Asp Phe Val Thr Gly Tyr Val Leu Lys His Lys Thr 

435 440 445 

He Ala Lys Pro Glu Asp Phe Ser He Thr Asn Glu Asp Tyr Ala Ala 
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450 455 460 

Phe Cys Lys Met Met Glu Glu Lys Lys Phe Asp Tyr Asp Arg Gin Ser 
465 470 475 480 

Gly Lys Met Leu Asp Lys Leu Glu Glu Leu Ala Lys lie Glu Gly Tyr 

485 490 495 

Leu Pro Glu Ala Asn Ser Glu Leu Lys Ala Leu Arg Glu Lys Leu Lys 

500 505 510 

Pro Asn Leu Ser Arg Asp Leu Leu Arg Phe Lys Lys Glu lie Thr Asn 

515 520 525 

Tyr Leu Asn Asn Glu lie Val Thr Arg Tyr Tyr Tyr Glu Arg Gly Ser 

530 535 540 

lie Arg Gin Ser Leu Pro Glu Asp Lys Val Val Lys Glu Ala lie Lys 
545 550 555 560 

Leu Leu Lys Asp His Pro Glu Gin lie Arg Gin lie Leu Ala Ala Pro 

565 570 575 

Lys Ala Glu Asn Lys Gly 
580 

(2) INFORMATION FOR SEQ ID NO: 17 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 95 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1. . .295 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 

Thr Lys Lys Thr Leu Met Lys Lys Leu Phe Leu Ser Leu Thr Ser Leu 
15 10 15 
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Val Met Val Phe Ala Val Ala Ser Cys Asp He He Asp Lys Asp Gin 

20 25 30 

Thr Leu Leu Pro Ala Pro Thr Asn Val Thr Pro Asp Asn Pro Asp Asp 

35 40 45 

Asn Pro Ser Glu He Asp He Thr Gin Thr His Thr Glu Lys Tyr Val 

50 55 60 

Leu Ala Glu Glu Phe Thr Gly Gin Lys Cys Leu Asn Cys Pro Lys Gly 
65 70 75 80 

His Arg Lys Leu Ala Ala Leu Lys Glu Gin Tyr Gly Lys Arg Leu Thr 

85 90 95 

Val Val Gly He His Ala Gly Pro Gly Ser Leu Val Pro Pro Leu Phe 

100 105 110 

Arg Thr Glu Ala Gly Asp Ala Tyr Tyr Ser Lys Phe Ala Asn Asn Thr 

115 120 125 

Pro Leu Pro Ala Leu Met Val Ser Arg Lys Lys Phe Gly Ser Ser Tyr 

130 135 140 

Val Tyr Asp Lys Ser Tyr Lys Thr Trp Asp Val Pro He Ala Glu Gin 
145 150 155 160 

Met Glu Gin Lys Ala Lys He Asn He Phe Ala Val Ala Glu Tyr Thr 

165 170 175 

Asp Thr Gin Lys He Lys Val Thr Val Lys Gly Lys He Leu Glu Gly 

180 185 190 

Asn Thr Leu Pro Lys Ser Met Val Gin Val Tyr Leu Leu Glu Asp Lys 

195 200 205 

•Leu He Ala Pro Gin Val Asp Gly Asn Thr Thr Val Glu Asn Tyr Glu 

210 215 220 

His Asn His Val Leu Arg Gly Ala Val Asn Gly He Trp Gly Glu Glu 
225 230 235 240 

Phe Val Asn Leu Lys Asp Tyr Leu Tyr Thr Tyr Ala Val Glu Pro Leu 

245 250 255 

Ser Gly Met Ser Phe Val Ala Glu Asn Tyr Ser He Val Ala Phe Val 

260 265 270 

Tyr Asp Val Gin Thr Phe Glu Val Tyr Asp Val Val His Val Lys He 

275 280 285 

Asn Pro Gin Ser Asp Gly Lys 
290 295 



(2) INFORMATION FOR SEQ ID NO: 18 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1046 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1...104 6 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 

Asp Phe Pro Trp Val Arg Val Lys Pro Glu Lys Lys Arg Lys Gin His 

15 10 15 

Asn Ser Asn Asn Phe Lys Phe Asn Ala Lys Glu Lys Ser Met Lys Arg 

20 25 30 

Met Thr Leu Phe Phe Leu Cys Leu Leu Thr Ser lie Gly Trp Ala Met 

35 40 45 

Ala Gin Asn Arg Thr Val Lys Gly Thr Val lie Ser Ser Glu Asp Asn 

50 55 60 

Glu Pro Leu lie Gly Ala Asn Val Val Val Val Gly Asn Thr Thr lie 
65 70 75 80 

Gly Ala Ala Thr Asp Leu Asp Gly Asn Phe Thr Leu Ser Val Pro TVla 

85 90 95 

Asn Ala Lys Met Leu Arg Val Ser Tyr Ser Gly Met Thr Thr Lys Glu 

100 105 110 

Val Ala lie Ala Asn Val Met Lys lie Val Leu Asp Pro Asp Ser Lys 

115 120 125 

Val Leu Glu Gin Val Val Val Leu Gly Tyr Gly Thr Gly Gin Lys Leu 

130 135 140 

Ser Thr Val Ser Gly Ser Val Ala Lys Val Ser Ser Glu Lys Leu Ala 
145 150 155 160 

Glu Lys Pro Val Ala Asn lie Met Asp Ala Leu Gin Gly Gin Val Ala 

165 170 175 

Gly Met Gin Val Met Thr Thr Ser Gly Asp Pro Thr Ala Val Ala Ser 
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180 

Val Glu lie His 
195 

Tyr lie Val Asp 
210 

Asn Pro Asn Asp 
225 

Thr Ser lie Tyr 

Thr Lys Lys Gly 
260 

Ser Tyr Gly lie 
275 

Met Thr Gly Asp 
290 

Gly Asn Asn Gin 
305 

Ala Glu Asp Leu 

Lys Thr Leu Phe 
340 

Ala Leu Phe Lys 
355 

Gly Gly Ser Gin 
370 

Gin Glu Gly Met 
385 

Arg Leu Asn Phe 

Asn Leu Ser Gly 
420 

Lys Tyr Tyr Met 
435 

Tyr Tyr Asn Pro 
450 

Met Tyr Gly Ala 
465 

Met Arg Pro Phe 
Ala Gin lie Thr 



Gly Thr Gly Ser 
200 

Gly Met Gin Thr 
215 

Phe Glu Ser Met 
230 

Gly Ala Arg Ala 
245 

Lys Met Ser Glu 

Ser Gin lie Leu 
280 

Glu Leu Leu Asp 
295 

Thr Val Gin Lys 
310 

Tyr Gly Asn Tyr 
325 

Pro Val Asp Phe 

Thr Ala Pro Thr 
360 

Gly Thr Ser Tyr 
375 

Ala Arg Glu Pro 
390 

Glu Ser Arg lie 
405 

Ala He Ala Asn 

Gly Ser Gly Thr 
440 

Phe Asp Val Asn 
455 

Thr Arg Pro Ser 
470 

Ser Ser Glu Ser 
485 

Pro He Lys Gly 



185 

Leu Gly Ala Ser 

Ser Leu Asp Val 
220 

Ser Val Leu Lys 
235 

Ala Asn Gly Val 
250 

Arg Gly Arg He 
265 

Asn Thr Lys Pro 

Phe Gin Val Lys 
300 

Val Lys Asp Met 
315 

Asp Ser Leu Lys 
330 

Asn His Asp Ala 
345 

Ser Gin Gly Asp 

Tyr Ala Ser He 
380 

Ala Asn Phe Lys 
395 

Asn Glu Trp Leu 
410 

Arg Arg Ser Ala 
425 

Phe Gly Val Leu 

Gly Asp Leu Ala 
460 

Met Thr Glu Pro 
475 

His Gin Ala Asn 
490 

Leu Thr Leu Lys 



190 

Ser Ala Pro Leu 
205 

Val Ala Thr Met 

Asp Ala Ser Ala 
240 

Val Phe He Gin 
255 

Thr Phe Asn Ala 
270 

Leu Asp Asn Met 
285 

Ala Gly Phe Trp 

He Leu Ala Gly 
320 

Asp Glu Tyr Gly 

335 

Asp Trp Leu Lys 
350 

He Ser Phe Ser 
365 

Gly Tyr Phe Asp 

Arg Tyr Ser Gly 
400 

Lys Val Gly Ala 
415 

Asp Tyr Phe Gly 
430 

Thr Met Pro Arg 
445 

Asp Val Tyr Tyr 

Tyr Phe Ala Lys 
480 

Val Asn Gly Phe 
495 

Ala Gin Ala Gly 
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500 505 510 

Val Asp lie Thr Asn Thr Arg Thr Ser Ser Lys Arg Met Pro Asn Asn 

515 520 525 

Pro Tyr Asp Ser Thr Pro Leu Gly Glu Arg Arg Glu Arg Ala Tyr Arg 
5 530 535 540 

Asp Val Ser Lys Ser Phe Thr Asn Thr Ala Glu Tyr Lys Phe Ser lie 
545 550 555 560 

Asp Glu Lys His Asp Leu Thr Ala Leu Met Gly His Glu Tyr He Glu 
565 570 575 

10 Tyr Glu Gly Asp Val He Gly Ala Ser Ser Lys Gly Phe Glu Ser Asp 
580 585 590 

Lys Leu Met Leu Leu Ser Gin Gly Lys Thr Gly Asn Ser Leu Ser Leu 

595 600 605 

Pro Glu His Arg Val Ala Glu Tyr Ala Tyr Leu Ser Phe Phe Ser Arg 
15 610 615 620 

Phe Asn Tyr Gly Phe Asp Lys Trp Met Tyr He Asp Phe Ser Val Arg 
625 630 635 640 

Asn Asp Gin Ser Ser Arg Phe Gly Ser Asn Asn Arg Ser Ala Trp Phe 
645 650 655 

20 Tyr Ser Val Gly Gly Met Phe Asp He Tyr Asn Lys Phe He Gin Glu 
660 665 670 

Ser Asn Trp Leu Ser Asp Leu Arg Leu Lys Met Ser Tyr Gly Thr Thr 

675 680 685 

Gly Asn Ser Glu He Gly Asn Tyr Asn His Gin Ala Leu Val Thr Val 
25 690 695 700 

Asn Asn Tyr Thr Glu Asp Ala Met Gly Leu Ser He Ser Thr Ala Gly 
705 710 715 720 

Asn Pro Asp Leu Ser Trp Glu Lys Gin Ser Gin Phe Asn Phe Gly Leu 
725 730 735 

30 Ala Ala Gly Ala Phe Asn Asn Arg Leu Ser Ala Glu Val Asp Phe Tyr 
740 745 750 

Val Arg Thr Thr Asn Asp Met Leu He Asp Val Pro Met Pro Tyr He 

755 760 765 

Ser Gly Phe Phe Ser Gin Tyr Gin Asn Val Gly Ser Met Lys Asn Thr 
35 770 775 780 

Gly Val Asp Leu Ser Leu Lys Gly Thr He Tyr Gin Asn Lys Asp Trp 
785 790 795 800 

Asn Val Tyr Ala Ser Ala Asn Phe Asn Tyr Asn Arg Gin Glu He Thr 
805 810 815 

40 Lys Leu Phe Phe Gly Leu Asn Lys Tyr Met Leu Pro Asn Thr Gly Thr 
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820 825 830 

He Trp Glu He Gly Tyr Pro Asn Ser Phe Tyr Met Ala Glu Tyr Ala 

835 840 845 

Gly He Asp Lys Lys Thr Gly Lys Gin Leu Trp Tyr Val Pro Gly Gin 

850 855 860 

Val Asp Ala Asp Gly Asn Lys Val Thr Thr Ser Gin Tyr Ser Ala Asp 
865 870 875 880 

Leu Glu Thr Arg He Asp Lys Ser Val Thr Pro Pro He Thr Gly Gly 

885 890 895 

Phe Ser Leu Gly Ala Ser Trp Lys Gly Leu Ser Leu Asp Ala Asp Phe 

900 905 910 

Ala Tyr He Val Gly Lys Trp Met He Asn Asn Asp Arg Tyr Phe Thr 

915 920 925 

Glu Asn Unk Gly Gly Leu Met Gin Leu Asn Lys Asp Lys Met Leu Leu 

930 935 940 

Asn Ala Trp Thr Glu Asp Asn Lys Glu Thr Asp Val Pro Lys Leu Gly 
945 950 955 960 

Gin Ser Pro Gin Phe Asp Thr His Leu Leu Glu Asn Ala Ser Phe Leu 

965 970 975 

Arg Leu Lys Asn Leu Lys Leu Thr Tyr Val Leu Pro Asn Ser Leu Phe 

980 985 990 

Ala Gly Gin Asn Val He Gly Gly Ala Arg Val Tyr Leu Met Ala Arg 

995 1000 1005 

Asn Leu Leu Thr Val Thr Lys Tyr Lys Gly Phe Asp Pro Glu Ala Gly 

1010 1015 1020 

Gly Asn Val Gly Lys Asn Gin Tyr Pro Asn Ser Lys Gin Tyr Val Ala 
1025 1030 1035 1040 

Gly He Gin Leu Ser Phe 
1045 



(2) INFORMATION FOR SEQ ID NO: 19 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 amino acids 

(B) TYPE: amino acid 
( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 



( iii ) HYPOTHETICAL : YES 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1. . .861 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 



Leu Glu Met Ala 
1 

Tyr Ser Tyr Gin 

20 

Asp Met Leu Val 
35 

Asp lie Met Arg 
50 

lie Glu Leu Glu 
65 

Ser Pro lie Phe 

Asp lie Cys Ala 
100 

.Leu Leu Ser lie 
115 

Phe Met Lys Gin 
130 

Gly Gin Arg Asn 
145 

Leu Asp Gly Tyr 

Ser Pro Pro Ser 
180 

Pro Glu Gin Asn 
195 

Gly Gly Asp Thr 
210 

Met Ala Ala Ala 
225 



Tyr Asp Phe Thr 
5 

Glu Ala Thr Arg 

Leu Gly lie lie 
40 

His Tyr Gly lie 
55 

Ala lie Ala Glu 
70 

Thr Pro Ser Ala 
85 

Asp Met Glu Asp 

Leu Asn Ser Thr 
120 

Gly lie Lys Tyr 
135 

Pro Ser Glu Gly 
150 

Gin Asp Asn Asp 
165 

Gly Asn Ser Gly 

Thr Gly Gly Gly 
200 

Pro Ala Leu Asp 
215 

Gly Lys Leu Asp 
230 



Gin Thr Phe Arg 

10 

Leu Gly Val Val 
25 

Arg Asp Gly Asp 

Asn Leu Tyr Glu 
60 

Ser Leu Pro Ala 
75 

Arg Glu Ala lie 
90 

Glu Ala Val Ser 
105 

Gin Glu Ser Leu 

Asp Thr lie Leu 
140 

Lys Ser Pro Ser 
155 

Phe Asp Asp Glu 
170 

Thr Gly Gly Gly 
185 

Asp Thr Thr Thr 

Thr Phe Gly Thr 
220 

Pro Val Val Gly 
235 



Asn Ser Leu Glu 
15 

Ala Val Thr Gin 

30 

Asn Gly Ala lie 
45 

Leu Lys Arg Leu 

Ser Pro Glu Gly 
80 

Asp Asp Ala Thr 
95 

Pro Val His Leu 
110 

Val Gin Lys lie 
125 

Ser Asp Tyr Phe 

Glu Met Glu lie 
160 

Glu Asp Glu Ser 
175 

Ser Gly Asp Ala 
190 

Thr Thr Arg Ser 
205 

Asp lie Thr Ala 

Arg Glu Gin Glu 
240 



lie Glu Arg Val 

Val Leu He Gly 
260 

Leu Ala Glu Arg 
275 

Lys Arg He He 
290 

Tyr Arg Gly Gin 
305 

Lys Lys Asn Pro 

Val Gly Ala Gly 
340 

Lys Pro Ala Leu 
355 

Leu Asp Glu Tyr 
370 

Arg Phe Gin Lys 
385 

Thr He Leu Gin 

Arg Tyr Thr Asp 
420 

.Tyr Val Ser Asp 
435 

Glu Ala Gly Ala 
450 

He Glu He Leu 
465 

Ser Ala Val Lys 

Gin Glu Arg Arg 
500 

Glu Glu Gin Met 
515 

Ala His Val Val 
530 

Thr Gly Glu Gly 
545 



He Gin He Leu 
245 

Glu Pro Gly Val 

He Val Asn Arg 
280 

Ser Leu Asp Leu 
295 

Phe Glu Glu Arg 
310 

Gin He He Leu 
325 

Ser Ala Ala Gly 

Ala Arg Gly Gin 
360 

Arg Lys Asn He 
375 

Val Pro He Ala 
390 

Asn He Lys Glu 
405 

Glu Ala He Lys 

Arg Phe Phe Pro 
440 

Ser Val His He 
455 

Glu Ala Glu Leu 
470 

Ala Gin Asn Tyr 
485 

Thr Gin Gin Gin 

Ser Lys His Arg 
520 

Ala Leu Met Thr 
535 

Glu Arg Leu Arg 
550 



Ser Arg Arg Lys 
250 

Gly Lys Ser Ala 
265 

Lys Val Ser Arg 

Ala Gin Met Val 
300 

Leu Lys Ala Val 
315 

Phe He Asp Glu 
330 

Ser Met Asp Thr 
345 

Val Gin Cys He 

Glu Lys Asp Gly 
380 

Pro Ser Thr Ala 
395 

Lys Tyr Glu Asp 
410 

Ala Ala Val Glu 
425 

Asp Lys Ala He 

Thr Asn Val Val 
460 

Ala Ser Val Arg 
475 

Glu Leu Ala Ala 
490 

He Ala Glu Glu 
505 

Glu Thr Val Asp 

Gly Val Pro Ala 
540 

Thr Met Ala Asp 
555 



Lys Asn Asn Pro 
255 

He Val Glu Gly 
270 

He Leu Phe Asp 
285 

Ala Gly Thr Lys 

Leu Asp Glu Leu 
320 

He His Thr He 
335 

Ala Asn Met Leu 
350 

Gly Ala Thr Thr 
365 

Ala Leu Glu Arg 

Glu Glu Thr Leu 
400 

Tyr His Gly Val 
415 

Leu Thr Asp Arg 
430 

Asp Ala Met Asp 
445 

Ala Pro Lys Glu 

Glu Asn Lys Leu 
480 

Ser Phe Arg Asp 
495 

Lys Lys Lys Trp 
510 

Glu Asn Val Val 

525 

Glu Arg Leu Ser 

Asp Leu Lys Thr 
560 



Lys Val Val Gly 

Gin Arg Asn Arg 
580 

Phe Leu Phe Leu 
595 

Lys Leu Ala Glu 
610 

Asp Met Ser Glu 
625 

Ala Pro Pro Gly 

Arg Val Arg Arg 
660 

Lys Ala His Ala 
675 

Gly Gin Leu Thr 
690 

Val lie lie He 
705 

Gly Gin Gly He 

Glu His Ser Arg 
740 

.Pro Glu Phe Leu 
755 

Gly Lys Thr Glu 
770 

Leu Ala Arg He 
785 

Ala Lys Asp Val 

Arg Pro Leu Lys 
820 

Asp Leu He Leu 
835 

Ser Ala Arg Asp 
850 

(2) INFORMATION 



Gin Asp Thr Ala 
565 

Leu Gly Leu Arg 

Gly Pro Thr Gly 
600 

Tyr Leu Phe Glu 
615 

Tyr Met Glu Lys 
630 

Tyr Val Gly Tyr 
645 

Lys Pro Tyr Ser 

Asp Val Phe Asn 
680 

Asp Ser Leu Gly 
695 

Thr Ser Asn Val 
710 

Gly Phe Arg Ser 
725 

Ser Val He Gin 

Asn Arg Leu Asp 
760 

He Arg Arg Met 
775 

His Arg Ala Gly 
790 

He Ala Thr Lys 
805 

Arg Thr Leu Gin 

Ser Gly Gin He 
840 

Gly Glu He He 
855 

FOR SEQ ID NO: 20 



He Glu Lys Met 
570 

Asn Glu Lys Lys 
585 

Val Gly Lys Thr 

Asp Glu Asn Ala 
620 

Phe Ser Val Ser 
635 

Glu Glu Gly Gly 
650 

Val Val Leu Leu 
665 

Leu Leu Leu Gin 

Arg Arg Val Asn 
700 

Gly Thr Arg Gin 
715 

Glu Lys Asp Glu 
730 

Lys Ala Leu Asn 
745 

Asp He He Leu 

Val Asp He Glu 
780 

Tyr Asp Leu Val 
795 

Gly Tyr Asp Leu 
810 

Asn Glu Val Glu 
825 

Glu Lys Gly Gin 

Val Gin Glu Gin 
860 



Val His Ala He 
575 

Pro He Gly Ser 
590 

Tyr Leu Ala Lys 
605 

Met He Arg Val 

Arg Leu Val Gly 
640 

Gin Leu Thr Glu 
655 

Asp Glu He Glu 
670 

Val Met Asp Glu 
685 

Phe Lys Asn Thr 

Leu Lys Asp Phe 
720 

Glu Ala Asn Lys 
735 

Lys Thr Phe Ser 
750 

Phe Asp Gin Leu 
765 

Leu Lys Ala Val 

Leu Thr Asp Glu 
800 

Gin Tyr Gly Ala 
815 

Asp Arg Leu Thr 
830 

Thr Leu Thr Leu 
845 
Ala 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 299 amino acids 

(B) TYPE: amino acid 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

10 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 
15 (A) NAME/ KEY: misc_feature 

(B) LOCATION 1. . .299 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 

20 Leu Lys Lys Glu He Thr Met Lys Gin Asn Tyr Phe Lys Arg Val Cys 
1 5 , 10 IS 

Ser Leu Leu Trp Leu Val Leu Pro Met Leu He Met Pro Leu Glu Val 

20 25 30 

Ala Ala Gin Glu He He Pro Asn Glu Glu Val Leu Glu Ser Leu Thr 
25 35 40 45 

Phe Val TUa Pro Val Glu Glu Thr Asp Ala He Glu Ala Glu Val Glu 

50 55 60 

Ala Leu Gin Glu He Val Ala Thr Glu Glu He Ala Glu Gin Ala Val 
65 70 75 80 

30 Arg Ser Tyr Thr Tyr Thr Val Tyr Arg Asp Gly Val Lys He Ala Ser 

85 90 95 

Gly Leu Thr Glu Pro Thr Phe Leu Asp Glu Asp Val Pro Ala Gly Glu 

100 105 110 

His Thr Tyr Cys Val Glu Val Gin Tyr Gin Gly Gly Val Ser Asp Lys 
35 115 120 125 

Val Cys Val Asp Val Glu Val Lys Asp Phe Lys Pro Val Thr Asn Leu 

130 135 140 

Thr Gly Thr Ala Ser Asn Asp Glu Val Ser Leu Asp Trp Asp Gly Val 
145 150 155 160 

40 Glu Glu Lys Ala Glu Glu Pro Ala Ser Asp Lys Ala Val Ser Tyr Asn 
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165 

Val Tyr Lys Asn Gly 
180 

Val Glu Thr Gly Val 
195 

Lys Tyr Pro Asp Gly 
210 

Thr Asn Ser Ser Leu 
225 

Arg Val Glu Gly Lys 
245 

Leu Tyr Asp lie Asn 
260 

Glu Tyr Met Ala Gin 
275 

Asn Lys His His Val 

290 



170 

Thr Leu lie Gly Asn Thr 
185 

Ala Asn Gly Thr Tyr lie 
200 

Val Ser Pro Lys Val Ala 
215 

Ser Asn Val Asp Gly Gin 
230 235 
Lys lie lie Ala Glu Ala 
250 

Gly Arg Thr Val Ala Val 
265 

Thr Gly Phe Tyr Ala Val 
280 

Ser Lys lie Gin Val Arg 

295 



175 

Ala Glu Thr His Tyr 
190 

Tyr Glu Val Glu Val 

205 

Val Thr Val Thr Val 
220 

Ala Pro Tyr Thr Leu 
240 

His Gly Met lie Thr 
255 

Ala Pro Asn Arg Leu 
270 

Arg Phe Asp Val Gly 
285 



(2) INFORMATION FOR SEQ ID NO: 21 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 64 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphy romonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1. . . 464 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 



Tyr Gly Lys Arg Arg Lys Leu Gly Thr Ser Val Arg Pro Ser Val Leu 
^ S 10 15 
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Thr Gin He Arg Phe He Leu Asp Leu His Leu He Thr Asp Phe Phe 

20 25 30 

Glu Gly Leu Arg Val Asn Pro He Gly Ala Ala Ala He Val Ala Phe 

35 40 45 

He He Asp Leu Leu Leu Leu Cys Cys Ser Ala Phe Met Ser Ser Cys 

50 55 60 

Glu Val Ala Tyr Phe Ser Leu Lys Pro He Asp Leu Gin Asn He Arg 
65 70 75 80 

Glu Arg Asn His Ser Ser Asp He Ala Leu Ser Asn Leu Leu Asp Asn 

85 90 
Ser Asn Gin Leu Leu Ala Thr He Leu He Gly Asn Asn Val He Asn 

100 105 110 

val Ala He Val He Leu Ser Asn Tyr Ala He Glu Gin Thr Phe Val 

115 120 125 

Phe ser Ser Pro He He Gly Phe Leu He Gin Thr He Leu Leu Thr 

130 135 140 

Thr Val Leu Leu Leu Phe Gly Glu He Leu Pro Lys Val Tyr Ala Arg 
145 150 155 160 

Lys Asn Pro Leu Gin Tyr Ser Arg Phe Ser Ala Ala Ala Met Ser Val 

165 170 175 

He Tyr Lys He Leu Ser Pro Phe Ser Lys Leu Leu Val Lys Ser Thr 

180 185 190 

Gly He Val Thr Arg Gly He Ser Lys Lys Lys Tyr Asp Met Ser Val 

195 200 205 

Asp Glu Leu Ser Lys Ala Val Ala Leu Thr Thr Thr Glu Gly Glu Pro 

210 215 220 

Glu Glu Lys Glu Met He Asn Glu He He Lys Phe Tyr Asn Lys Thr 
225 230 235 240 

Ala Cys Glu He Met Val Pro Arg He Asp He Val Asp Val Asp Leu 

245 250 255 

Ser Trp Pro Phe Arg Lys Met Leu Asp Phe Val Val Ser Ser Gly Tyr 

260 265 270 

Ser Arg Leu Pro Val Ser Glu Gly Ser Glu Asp Asn He Lys Gly Val 

275 280 285 

He Tyr He Lys Asp Leu He Pro His Met Asp Lys Gly Asp Glu Phe 

290 295 300 

Asp Trp His Pro Leu He Arg Lys Ala Tyr Phe Val Pro Glu Asn Lys 

315 320 

305 310 

Arg He Asp Asp Leu Leu Glu Glu Phe Arg Ala Asn Lys Val His Val 
325 330 335 
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Ser He Val Val Asp 
340 

Glu Asp He Leu Glu 
355 

Glu Glu Glu Leu Pro 
370 

Glu Gly Lys Thr Ser 
385 

Glu Asn Ala Phe Gly 
405 

Leu Phe Leu Glu He 
420 

Val Tyr Glu Pro Phe 
435 

lie He Glu He Lys 
450 



Glu Phe Gly Gly Thr Cys 
345 

Glu He Val Gly Glu He 
360 

Phe Lys Val Leu Gly Asp 
375 

Leu Ser Asp Val Arg His 
390 395 
Glu Leu Gly Asp Glu Val 
410 

Lys Gin Glu Leu Pro His 
425 

Arg Phe Gin Val Thr Gin 
440 

He Phe Pro Phe Glu Arg 
455 



Gly Leu He Thr Met 
350 

Thr Asp Glu Tyr Asp 
365 

Gly Ser Tyr Leu Phe 
380 

Tyr Leu Asp Leu Pro 
400 

Asp Thr Leu Ser Gly 
415 

Val Gly Asp Thr Ala 
430 

Met Asp Lys Arg Arg 
445 

Thr Trp Glu Val Glu 
460 



(2) INFORMATION FOR SEQ ID NO: 22 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 69 amino acids 

(B) TYPE: ainino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION 1 ... 869 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22 



Trp His Arg Asn He Phe He Phe Ala Ser Thr Phe Ser Pro Lys Asn 

^5 10 15 

Met Leu Pro Leu Pro Tyr Arg Tyr Ala Lys Thr Glu His Leu Phe Leu 
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20 25 30 

Ala Lys Gly Tyr Cys Lys Asn Pro lie Thr Asn lie lie lie Leu Phe 

35 40 45 

Met Lys Lys Lys Asn Phe Leu Leu Leu Gly He Phe Val Ala Leu Leu 
5 50 55 60 

Thr Phe He Gly Ser Met Gin Ala Gin Gin Ala Lys Asp Tyr Phe Asn 
65 70 75 80 

Phe Asp Glu Arg Gly Glu Ala Tyr Phe Ser Phe Lys Val Pro Asp Arg 
85 90 95 

10 Ala Val Leu Gin Glu Leu Ala Leu He Met Ser He Asp Glu Phe Asp 
100 105 HO 

Pro Val Thr Asn Glu Ala He Ala Tyr Ala Ser Glu Glu Glu Phe Glu 

115 120 125 

Ala Phe Leu Arg Tyr Gly Leu Lys Pro Thr Phe Leu Thr Pro Pro Ser 
15 130 135 140 

Met Gin Arg Ala Val Glu Met Phe Asp Tyr Arg Ser Gly Glu Lys Tyr 
145 150 155 160 

Glu Trp Asn Ala Tyr Pro Thr Tyr Glu Ala Tyr He Ser Met Met Glu 
165 170 175 

20 Glu Phe Gin Thr Lys Tyr Pro Ser Leu Cys Thr Thr Ser Val He Gly 
ISO 185 190 

Lys Ser Val Lys Asp Arg Lys Leu Met He Cys Lys Leu Thr Ser Ser 

195 200 205 

Ala Asn Thr Gly Lys Lys Pro Arg Val Leu Tyr Thr Ser Thr Met His 
25 210 215 220 

Gly Asp Glu Thr Thr Gly Tyr Val Val Leu Leu Arg Leu He Asp His 
225 230 235 240 

Leu Leu Ser Asn Tyr Glu Ser Asp Pro Arg He Lys Asn He Leu Asp 
245 250 255 

30 Lys Thr Glu Val Trp He Cys Pro Leu Thr Asn Pro Asp Gly Ala Tyr 
260 265 270 

Arg Ala Gly Asn His Thr Val Gin Gly Ala Thr Arg Tyr Asn Ala Asn 

275 280 285 

Asn Val Asp Leu Asn Arg Asn Phe Lys Asp Asp Val Ala Gly Asp His 
35 290 295 300 

Pro Asp Gly Lys Pro Trp Gin Pro Glu Ala Thr Ala Phe Met Asp Leu 
305 310 315 320 

Glu Gly Asn Thr Ser Phe Val Leu Gly Ala Asn He His Gly Gly Thr 
325 330 335 

40 Glu Val Val Asn Tyr Pro Trp Asp Asn Lys Lys Glu Arg His Ala Asp 
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340 

Asp Glu Trp Tyr Lys 
355 

Ser lie Ser Ala Ser 
370 

Asn Gly Ser Asp Trp 
385 

Asn Tyr Phe His Arg 
405 

Lys Leu Val Pro Ala 
420 

Glu Ser Leu Leu Ala 
435 

Thr Val Thr Ser Ala 
450 

lie Glu Asn His Asp 

465 

Thr Gly Tyr Tyr Val 
485 

Tyr Lys Ala Glu Gly 
500 

Asp Lys Glu Thr Val 
515 

Leu Pro Val Pro Asp 
530 

Glu Ser Val Gin Phe 
545 

Glu Trp Thr Phe Glu 
565 

Pro Leu Val Ser Tyr 
580 

Val Trp Asn Ala Ser 
595 

Thr Val Asn Ala Val 
610 

Glu lie Glu Glu Gly 
625 

Ala Thr Asn Tyr Val 
645 

Glu Asp Glu Asn Pro 



345 

Leu lie Ser Arg Asn Tyr 
360 

Tyr Met Thr Ser Glu Thr 
375 

Tyr Val lie Arg Gly Ser 
390 395 
Leu Arg Glu lie Thr Leu 
410 

Ser Gin Leu Pro Lys Tyr 
425 

Leu lie Glu Glu Ser Leu 
440 

Ala Asn Gly Gin Pro Leu 
455 

Lys Arg Asn Ser Asp Val 
470 475 
Arg Pro lie Lys Ala Gly 
490 

Tyr Pro Glu Ala Thr Arg 
505 

lie Met Asp lie Ala Leu 
520 

Phe Thr Ala Ser Pro Met 
535 

Gin Asp Gin Thr Thr Asn 
550 555 
Gly Gly Gin Pro Ala Met 
570 

Ser His Pro Gly Gin Tyr 
585 

Gly Ser Asn Thr lie Thr 
600 

Met Pro Val Ala Glu Phe 
615 

Gin Thr Val Ser Phe Gin 
630 635 
Trp lie Phe Asp Gly Gly 
650 

Thr Val Leu Tyr Ser Lys 



350 

Ala Ala Ala Cys Gin 
365 

Asn Ser Gly lie lie 
380 

Arg Gin Asp Asn Ala 
400 

Glu lie Ser Asn Thr 
415 

Trp Asn Leu Asn Lys 
430 

Tyr Gly lie His Gly 
445 

Lys Cys Gin lie Leu 
460 

Tyr Ser Asp Ala Thr 

480 

Thr Tyr Thr Val Lys 
495 

Thr lie Thr lie Lys 
510 

Gly Asn Ser Val Pro 
525 

Thr lie Ser Val Gly 
540 

Asn Pro Thr Asn Trp 
560 

Ser Thr Glu Gin Asn 
575 

Asp Val Thr Leu Lys 
590 

Lys Glu Lys Phe lie 
605 

Val Gly Thr Pro Thr 
620 

Asn Gin Ser Thr Asn 
640 

Thr Pro Ala Thr Ser 
655 

Ala Gly Gin Tyr Asp 



59 



660 



665 



670 



Val Thr Leu Lys Ala He Ser Ala Ser Gly Glu Thr Val Lys Thr Lys 

675 680 685 

Glu Lys Tyr He Thr Val Lys Lys Ala Pro Val Pro Ala Pro Val Ala 

695 '^^^ 
ASP Phi Glu Gly Thr Pro Arg Lys Val Lys Lys Gly Glu Thr Val Thr 

710 715 720 

Ph! Lys ASP Leu Ser Thr Asn Asn Pro Thr Ser Trp Leu Trp Val Phe 
725 730 735 

10 Glu Gly Gly ser Pro Ala Thr Ser Thr Glu Gin Asn Pro Val Val Thr 
740 745 750 

Tyr Asn Glu Thr Gly Lys Tyr Asp Val Gin Leu Thr Ala Thr Asn Glu 



755 



760 765 



Gly Gly ser Asn Val Lys Lys Ala Glu Asp Tyr He Glu Val He Leu 



775 780 



Asp Hp ser val Glu Asp He Val Ala Gin Thr Gly He Val He Arg 



785 



790 



795 



Pro Gin Asn Gly Thr Lys Gin He Leu He Glu Ala Asn Ala Ala He 
805 810 815 

20 Lys Ala He Val Leu Tyr Asp He Asn Gly Arg Val Val Leu Lys Thr 

820 825 
Thr pro Asn Gin Leu Arg Ser Thr Val Asp Leu Ser He Leu Pro Glu 



835 



840 845 



Gly He Tyr Thr He Asn He Lys Thr Glu Lys Ser Ala Arg Thr Glu 



25 850 



855 



860 



30 



Lys He His He Gly 
865 

(2) INFORMATION FOR SEQ ID NO: 23 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 3 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : linear 

35 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: YES 
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(vi) ORIGINAL SOURCE: 



60 



(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: mis c_f eature 

(B) LOCATION 1. . . 843 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23 

Met Lys Lys Ser Phe Leu Leu Ala He Val Met Leu Phe Gly He Ala 

'5 10 15 

Met Gin Gly His Ser Ala Pro Val Thr Lys Glu Arg Ala Leu Ser Leu 

20 25 30 

Ala Arg Leu Ala Leu Arg Gin Val Ser Leu Arg Met Gly Gin Thr Ala 

35 40 45 

Val Ser Asp Lys He Ser He Asp Tyr Val Tyr Arg Gin Gly Asp Ala 

50 55 60 

Glu Arg Gly He Thr Ser Gin Glu Glu Gly Ser Pro Ala Tyr Phe Tyr 
" 70 75 80 

Val Ala Asn Arg Gly Asn Asn Glu Gly Tyr Ala Leu Val Ala Ala Asp 

85 90 95 

Asp Arg He Pro Thr He Leu Ala Tyr Ser Pro He Gly Arg Phe Asp 

100 105 110 

Met Asp Ser Met Pro Asp Asn Leu Arg Met Trp Leu Gin He Tyr Asp 

115 120 125 

Gin Glu He Gly Leu He Leu Ser Gly Lys Ala Gin Leu Asn Glu Glu 

130 135 140 

He Leu Arg Thr Glu Gly Val Pro Ala Glu Val His Ala Leu Met Asp 

150 155 
Asn Gly His Phe Ala Asn Asp Pro Met Arg Trp Asn Gin Gly Tyr Pro 

165 170 175 

Trp Asn Asn Lys Glu Pro Leu Leu Pro Asn Gly Asn His Ala Tyr Thr 

180 185 190 

Gly Cys Val Ala Thr Ala Ala Ala Gin He Met Arg Tyr His Ser Trp 

195 200 205 

Pro Leu Gin Gly Glu Gly Ser Phe Asp Tyr His Ala Gly Ser Leu Val 

210 215 220 

Gly Asn Trp Ser Gly Thr Phe Gly Glu Met Tyr Asp Trp He Asn Met 

230 235 240 

Pro Gly Asn Pro Asp Leu Asp Asn Leu Thr Gin Ser Gin Val Asp Ala 

250 255 
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Tyr Ala Thr Leu Met Arg Asp Val Ser Ala Ser Val Ser Met Ser Phe 

260 265 270 

Tyr Glu Asn Gly Ser Gly Thr Tyr Ser Val Tyr Val Val Gly Ala Leu 

275 280 285 

Arg Asn Asn Phe Arg Tyr Lys Arg Ser Leu Gin Leu His Val Arg Ala 

290 295 300 

Leu Tyr Thr Ser Gin Glu Trp His Asp Met He Arg Gly Glu Leu Ala 

•Jin 315 320 

305 310 ^^=> 

Ser Gly Arg Pro Val Tyr Tyr Ala Gly Asn Asn Gin Ser He Gly His 

325 330 335 

Ala Phe Val Cys Asp Gly Tyr Ala Ser Asp Gly Thr Phe His Phe Asn 

340 345 350 

Trp Gly Trp Gly Gly Val Ser Asn Gly Phe Tyr Lys Leu Thr Leu Leu 

355 360 365 

Ser Pro Thr Ser Leu Gly He Gly Gly Glu Gly He Gly Phe Thr He 

370 375 380 

Tyr Gin Glu He He Thr Gly He Glu Pro Ala Lys Thr Pro Ala Glu 

385 390 395 400 

Ala Gly Thr Asp Ala Leu Pro He Leu Ala Leu Lys Asp He Glu Ala 

405 410 415 

Glu Tyr Lys Ser Glu Ser Gly Leu Asn Val Gly Tyr Ser He Tyr Asn 

420 425 430 

Thr Gly Glu Glu Gin Ser Asn Leu Asp Leu Gly Tyr Arg Leu Asn Lys 

435 440 445 

Ala Asp Gly Glu Val He Glu Val Lys Thr Ser Ser He Asn He Ser 

450 455 460 

Trp Tyr Gly Tyr Gly Glu His Pro Glu Ser Phe Ser Leu Ala Pro Asn 

/nn 475 480 

465 470 

Gin Leu Ser Gin Gly He Asn Thr He Thr Leu Leu Tyr Arg Arg Thr 

485 490 495 

Gly Thr Glu Gin Trp Glu Pro Val Arg His Ala Gin Gly Gly Tyr Val 

500 505 510 

Asn Ser He Lys Val Asn Thr Thr Asp Pro Asn Asn Val Val Val Thr 

515 520 525 

Val Asp Asn Asn Glu Gly Lys Leu Ser He Val Pro Asn Ser Phe Val 

530 535 540 

Ala Asp Leu Asn Ser Tyr Glu His Ser Thr He Thr Val Gin Phe Asn 
545 550 555 560 

ser Asp Ser Pro Asp Glu He Arg Thr Pro Val Ala Phe Ala Leu Ser 
565 570 575 
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Thr Gly Ala Thr Ala Asp Asp Val lie Ser Leu Gly Trp Val Met Ala 

585 590 
Glu Val Pro Gly Gly Ser Ser Asn Tyr Pro Val Val Trp Ser Lys Asp 

600 605 
Val Leu Thr Leu Ser Glu Gly Asp Tyr Thr Leu Trp Tyr Arg Phe Ser 

615 620 
He Asn Asn Gin Lys Asp Glu Trp Lys Lys He Gly Ser Val Ser Val 

630 635 640 

Lys Thr Pro Thr Glu Tyr Thr His Pro Leu Phe Glu Val Gly His Asn 

645 650 655 

Gin Thr Ser Thr Tyr Thr Leu Asp Met Ala His Asn Arg Val Leu Pro 

665 670 
Asp Phe Thr Leu Lys Asn Leu Gly Leu Pro Phe Asn Gly Glu Leu Val 

^■^5 680 685 

Val Val Phe Arg Gin Thr Gin Ser Ser Ser Gly Ser Leu Trp Ala Ala 

695 700 
Gin Glu Thr Val His He Lys Gin Gly Glu Thr Phe Val Tyr Lys Pro 
■^05 710 715 720 

Val Val Glu Gly Pro He Pro Asp Gly Ser Tyr Arg Ala Thr Leu His 

"725 730 735 

Ala Phe Val Asn Gly Gin Gin Gin Leu Tyr Leu Lys Gly Lys Arg Asn 

740 745 
Tyr Thr Val Lys He Val Asn Gly Thr Ala Val Glu Ala He Glu Ser 

■^55 760 765 

Ser Glu Glu He Arg Val Phe Pro Asn Pro Ala Arg Asp Tyr Val Glu 

■^70 775 780 

He Ser Ala Pro Cys He Pro Gin Glu Thr Ser He He Leu Phe Asp 
'^^^ 790 795 800 

Leu Ser Gly Lys He Val Met Lys Asn Ser Leu Ser Ala Gly His Gly 

805 810 815 

Arg Met Asp Val Ser Arg Leu Pro Asn Gly Ala Tyr He Leu Lys Val 

820 825 830 

Asp Gly Tyr Thr Thr Lys He Asn He Val His 
835 840 



(2) INFORMATION FOR SEQ ID NO: 24 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1175 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingi 



(ix) FEATURE: 

(A) NAME/KEY: mis c_f eature 

(B) LOCATION 1,..1175 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24 

Met Thr Trp Leu He Asp Phe Pro Ala Leu Lys Lys Leu Asp Leu Ser 
1 5 10 IS 

Tyr Asn Gin He Ser Lys Leu Glu Gly Leu Glu Arg Leu Thr Ser Leu 

25 

Thr Lys Leu Arg Leu Arg Ser Asn Gin He Arg Lys Leu Glu Gly Leu 
35 

Asp ser Leu Thr Ser Leu Thr Lys Leu Ser Leu Ser Asp Asn Gin He 



50 



55 60 



ser Lys Leu Glu Gly Leu Glu Arg Leu Thr Ser Leu Ala Glu Leu Tyr 

70 75 BO 
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Leu Leu Asp Asn Gin He Ser Lys Leu Glu Gly Leu Glu Arg Leu Thr 

85 90 

ser Leu Ala Thr Leu Glu Leu Ser Gly Asn Gin He Arg Lys Leu Glu 

100 105 

Gly Leu Glu Arg Leu Thr Ser Leu Ala Thr Leu Glu Leu Ser Gly Asn 

^5 120 125 
Gin He Arg Lys Leu Glu Gly Leu Glu Arg Leu Thr Ser Leu Thr Lys 



130 



135 



Leu Arg Leu Arg Ser Asn Gin He Ser Lys Leu Glu Gly Leu Glu Arg 



160 

145 



150 155 



Leu Thr Ser Leu Ala Thr Leu Glu Leu Ser Gly Asn Gin He Arg Lys 

165 170 
Leu Glu Gly Leu Glu Arg Leu Thr Ser Leu Ala Thr Leu Glu Leu Ser 

180 185 190 

Gly Asn Gin He Ser Lys Leu Glu Gly Leu Glu Arg Leu Ser Ser Leu 
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195 

Thr Lys Leu Arg 

210 

Glu Arg Leu Thr 
225 

Ser Lys Leu Glu 

Leu Leu Asp Asn 
260 

Ser Leu Thr Lys 
275 

Gly Leu Asp Ser 
290 

Gin lie Ser Lys 
305 

Leu Tyr Leu Leu 

Leu Ala Ser Leu 
340 

Leu Glu Gly Leu 

355 

Gly Asn Asp lie 
370 

Leu Glu Gin Thr 
385 

Ala Ser Ser Gly 

lie Lys Ala Leu 
420 

Val Glu Tyr His 
435 

Ser Gly Lys Thr 
450 

Gin Lys Asn Thr 
465 

Ala lie Phe Tyr 

Gin Ala Phe Phe 
500 

Lys Lys Asp Arg 



200 

Leu Arg Ser Asn 
215 

Ser Leu Thr Lys 
230 

Gly Leu Glu Arg 
245 

Gin lie Arg Lys 

Leu Arg Leu Arg 
280 

Leu Thr Ser Leu 
295 

Leu Glu Gly Leu 
310 

Asp Asn Gin lie 

325 

Thr Arg Leu Ser 

Asp Arg Leu Lys 

360 

Gin Ser lie Asp 
375 

Leu Glu Lys Leu 
390 

Leu lie Leu Ser 
405 

Leu Glu Lys Glu 

Pro Phe Cys Lys 
440 

Thr Phe Leu Ser 
455 

His Val Leu Ser 
470 

Asp Phe Gly Gly 
485 

Thr Thr Gin Ser 
Asn Phe Val Ser 



Gin lie Ser Lys 
220 

Leu Ser Leu Ser 
235 

Leu Thr Ser Leu 
250 

Leu Glu Gly Leu 
265 

Ser Asn Gin He 

Thr Lys Leu Ser 
300 

Glu Arg Leu Thr 
315 

Arg Lys Leu Glu 

330 

Leu Arg Arg Asn 
345 

Val Leu Arg Lys 

Asp lie Lys Leu 
380 

Arg He His Asp 
395 

Pro Tyr Asp Asn 
410 

Lys Glu Lys Gin 
425 

Val Met Leu Leu 

Gin Tyr Asp Thr 
460 

He His Arg Ser 
475 

Gin Asp Tyr Tyr 
490 

Leu Tyr Leu Leu 
505 

Val Asp Asp Lys 



205 

Leu Glu Gly Leu 

Asp Asn Gin He 
240 

Ala Glu Leu Tyr 

255 

Glu Arg Leu Thr 
270 

Ser Lys Leu Glu 
285 

Leu Ser Asp Asn 

Ser Leu Ala Glu 
320 

Gly Leu Asp Gly 

335 

Gin He Ser Lys 
350 

Leu Asp Val Ser 
365 

Leu Ala Pro He 

Asn Pro Phe Val 
400 

His Leu Pro Glu 
415 

Lys Lys Thr Ser 
430 

Gly Asn His Ser 
445 

Asn Tyr Thr Tyr 

Asn Asn Pro Asn 
480 

His Gly He Tyr 
495 

Phe Trp Asp Ala 
510 

Glu Tyr Gin Thr 
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515 

Leu Asn Phe Asn 
530 

Asn Arg Cys Met 
545 

Gin Thr Thr Asp 

Gly Ala Lys Gin 
580 

Glu Glu lie Tyr 
595 

Ala Leu Asn Tyr 
610 

Ser Lys Ser lie 
625 

Leu Pro Thr lie 

Ala Leu Ala Ala 
660 

Thr lie Glu Tyr 
675 

Glu Val Leu Tyr 
690 

Leu Asp Pro Ala 
705 

Lys Asp Asn lie 

Lys Leu His Asn 
740 

Asn Gly Asn Met 
755 

Glu Asp Lys Asp 
770 

Asp Asp Glu Ala 
785 

Phe Val Leu Lys 

lie lie Ala Tyr 
820 

Arg Asp Gin Val 



520 

Arg Pro Tyr Trp 

535 

Ser Val Gly Gly 
550 

Asp Thr lie lie 
565 

Gin Thr Leu Gly 

Val Ser Leu Glu 
600 

Leu Asn Glu Arg 
615 

Gin lie Thr Glu 
630 

Ala Gly Asp Asn 
645 

Gin Leu Asn Lys 

Leu Gin Thr Glu 
680 

Tyr Arg Glu Asn 
695 

Ala Phe Val Gin 
710 

Asn Arg Gly Thr 
725 

Leu Ser Ser Gly 

lie Leu Gin Leu 
760 

Cys Tyr Val lie 
775 

Tyr Lys Trp Leu 
790 

Phe Glu Arg Phe 
805 

Tyr Gly Arg Glu 
lie Phe Thr Ala 



Leu Gly Gin lie 
540 

Asn Pro Asp Gly 
555 

lie Gin Thr His 
570 

Cys Ala Ala Glu 
585 

Pro Lys Ala Asn 

Val Arg Glu Val 
620 

Lys Asp Lys Gly 
635 

Lys His lie Pro 
650 

Gly Arg Ala Glu 
665 

Leu Asn Gin Leu 

Glu Lys Leu Asn 
700 

Met lie His Gly 
715 

Val Pro Lys Asp 
730 

Ser lie Phe Glu 
745 

Leu Leu Glu Glu 

Pro Gly Tyr Leu 
780 

Thr Leu Gly Phe 
795 

lie Pro Phe Gly 
810 

Glu Gly Ala Leu 
825 

Gly Arg Glu Met 



525 

Ala Tyr Ala Cys 

Lys Asp Thr Pro 
560 

Ala Asp Glu Thr 
575 

Asn Gly Val Leu 
590 

Ser Ala Val His 
605 

Val Ala Ser Arg 

Leu Tyr Glu Ala 
640 

lie Ser Leu Glu 
655 

Asn Asp Leu Tyr 
670 

Ser Leu Arg Gly 
685 

Asn Tyr Val Trp 

Glu lie Leu Gin 
720 

lie Phe Glu Cys 
735 

Glu Asp Gly Gin 
750 

Leu lie Val Tyr 
765 

Pro Leu His Ser 

Glu Arg Pro Asn 
800 

Leu lie Asn Gin 
815 

Lys Arg Tyr Trp 
830 

Asp Arg Gin Thr 
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835 840 845 

Leu Glu Gin Glu Glu Glu Lys Glu Gly Leu Pro Lys Thr Asn Ala Glu 

850 855 860 

Asp Tyr Gin lie Trp lie Lys Leu Asp Phe Thr Asp Leu Ala lie Ser 
865 870 875 880 

Val Phe lie Lys Glu Gin Arg Lys Thr Ser Ala Lys Asp Met Gin Arg 

885 890 895 

Lys Glu Ala Thr lie Leu Ser Asp Met Leu Asp Met Tyr Trp Asn Asn 

900 905 910 

He Pro Pro Arg Glu Gin He Gly Asp Lys Asp Thr Glu Gin Thr Arg 

915 920 925 

Ser Thr He Arg Glu Thr Asn Arg Lys Lys Arg Pro He Gin Asp Leu 

930 935 940 

Tyr Leu Ser Cys Ala Gin Ala Asp Lys Asp Leu Thr Glu Ser His Tyr 
945 950 955 960 

He His Leu Gly Thr Leu Asp Asp Glu Ser Lys Thr Thr Ala Arg He 

965 970 975 

Ala Ala Tyr Pro Leu Lys Asn Gly Val He Asp Lys Glu Arg Val Arg 

980 985 990 

Glu Val Ser Thr Arg Pro Tyr Lys His Leu Ser Val Asn Lys Asn Leu 

995 1000 1005 

Ala Thr Ala Lys Gin He Phe He Ser Tyr Ser Lys Glu Asp Gin Thr 

1010 1015 1020 

Glu Leu Glu Thr Cys Leu Gin Phe Phe* Lys Pro Leu Glu Lys Asn Gly 
.1025 1030 1035 1040 

Gin He Glu He Tyr Tyr Asp Lys Leu Thr Lys Phe Glu Thr Pro He 

1045 1050 1055 

His Pro Glu He Arg Lys Arg He Val Glu Ala Asp Cys He He Ala 

1060 1065 1070 

Leu He Ser Gin Arg Tyr Leu Ala Thr Asp Tyr He Leu Asp His Glu 

1075 1080 1085 

Leu Pro Val Phe Arg Glu Tyr Asn Lys Thr He Val Pro He Leu He 

1090 1095 1100 

Lys Pro Cys Thr Phe Glu Asp Asp Glu Phe Leu Arg Glu Lys Tyr Phe 
1105 1110 1115 1120 

Ala Gin Lys Ala Gin He He Asn Leu Gly Lys Glu Gly Lys Thr He 

1125 1130 1135 

Lys Ala Tyr Asp Ser He Thr Ala Ser Ala His Arg Asp Glu Asn Trp 

1140 1145 1150 

Val Ala Val Val Arg Glu Phe Lys Glu Lys He Leu Arg He Thr Lys 
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1155 1160 1165 

Gin Glu Val Asn Thr Asp Glu 
ino 1175 

(2) INFORMATION FOR SEQ ID NO: 25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 312 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1. . . 312 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 

Met lie Pro Leu Ser Glu Ser Phe Glu Ser Gly He Pro Ala He Trp 
1 5 10 

Lys Thr He Asp Ala Asp Gly Asp Gly Tyr Asn Trp Met His Leu Thr 
20 

Asn Phe Thr Gly Gin Ser Gly Leu Cys Val Ser Ser Ala Ser Tyr He 

35 

Gly Gly val Gly Ala Leu Thr Pro Asp Asn Tyr Leu He Thr Pro Glu 

55 60 
Leu Lys Leu Pro Thr Asp Ala Leu Val Glu He He Tyr Trp Val Cys 
65 80 
Thr Gin Asp Leu Thr Ala Pro Ser Glu His Tyr Ala Val Tyr Ser Ser 
85 90 " 

r Thr Gly Asn Asn Ala Ala Asp Phe Val Asn Leu Leu Tyr Glu Glu 



Se 

100 



105 110 



Thr Leu Thr Ala Lys Arg He Gin Ser Pro Glu Leu He Arg Gly Asn 
115 



120 125 
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Arg Thr Gin Gly Val Trp Tyr Gin Arg Lys Val Val Leu Pro Asn Asp 

130 135 140 

Thr Lys Tyr Val Ala Phe Arg His Phe Asn Ser Thr Asp Asn Phe Trp 
145 150 155 160 

Leu Asn Leu Asp Glu Val Ser He Leu Tyr Thr Pro Leu Pro Arg Arg 

165 170 
Ala Pro Cys Pro His Pro Gly Gly Tyr Thr Tyr Ser Val Phe Arg Asp 

180 185 190 

Gly Gin Lys He Ala Ser Gly Leu Ser Ala Leu Ala Tyr He Asp Thr 

195 200 205 

Asp Val Pro Tyr Gly Thr Gin Asp Tyr Cys Val Gin Val Asn Tyr Leu 

210 215 220 

Gin Gly Asp Ser Tyr Lys Val Cys Lys Asn He Val Val Ala Asn Ser 
225 230 235 240 

Ala Asn He Tyr Gly Ala Asp Lys Pro Phe Ala Leu Thr Val Val Gly 

245 250 255 

Lys Thr He Val Ala Ser Ala Phe Lys Gly Glu He Thr Leu Tyr Asp 

260 265 270 

He Arg Gly Arg Leu He Ala Ser Gly Cys Asp Thr Leu Arg Tyr Lys 

275 280 285 

Ala Glu Asn Gly Phe Tyr Leu He Lys He Gin Val Asn Gly Thr Val 

290 295 300 

Tyr Thr Glu Lys He Gin He Gin 
305 310 

(2) INFORMATION FOR SEQ ID NO: 26 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 757 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 



(ix) FEATURE: 
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(A) NAME/KEY: mis c_f eature 

(B) LOCATION 1. . .757 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 

Met Arg Thr Lys Thr He Phe Phe Ala He He Ser Phe He Ala Leu 

1 5 10 15 

Leu Ser Ser Ser Leu Ser Ala Gin Ser Lys Ala Val Leu Thr Gly Ser 

20 25 30 

Val Ser Asp Ala Glu Thr Gly Glu Pro Leu Ala Gly Ala Arg He Glu 

35 40 45 

Val Lys His Thr Asn He Val Ala Gly Ala Asp Ala Gly Gly His Phe 

50 55 60 

Glu He Lys Asn Leu Pro Ala Gly Gin His Thr He He Cys Ser Leu 
65 70 75 80 

Gly Gly Tyr Gly Gin Lys Glu Glu Val Val Ala He Glu Ala Gly Gin 

85 90 95 

Thr Lys Thr He Ser Phe Ala Leu Arg Leu Arg Thr Asn Asn Leu Glu 

100 105 110 

Glu Val Val Val Thr Gly Thr Gly Thr Arg Tyr Arg Leu Val Asp Ala 

115 120 125 

Pro Val Ala Thr Glu Val Leu Thr Ala Lys Asp He Ala Ser Phe Ser 

130 135 140 

Ala Pro Thr Ser Glu Ala Leu Leu Gin Gly Leu Ser Pro Ser Phe Asp 
145 150 155 160 

Phe Gly Pro Asn Leu Met Gly Ser Phe Met Gin Leu Asn Gly Leu Ser 

165 170 175 

Ser Lys Tyr He Leu He Leu He Asp Gly Lys Arg Val Tyr Gly Asp 

180 185 190 

Val Gly Gly Gin Ala Asp Leu Ser Arg He Ser Pro Asp Gin He Glu 

195 200 205 

Arg He Glu Leu Val Lys Gly Ala Ser Ser Ser Leu Tyr Gly Ser Asp 

210 215 220 

Ala He Ala Gly Val He Asn Val He Thr Lys Lys Asn Thr Asn Arg 
225 230 235 240 

Leu Ser Ala Tyr Thr Ser His Arg He Ser Lys Tyr Asn Asp Arg Gin 

245 250 255 

Thr Asn Thr Ser Leu Asp He Asn He Gly Lys Phe Ser Ser Asn Thr 

260 265 270 

Asn Tyr Phe Phe Tyr His Thr Asp Gly Trp Gin Asn Ser Pro Phe Glu 



70 

275 280 285 

lie Lys Lys Lys Lys Gly Ser Gly Glu Pro Val Leu Glu Glu Thr Tyr 

290 295 300 

Lys Lys Thr Phe Arg Ala Gin Glu Asn Gin Gly Val Ser Gin Ser Leu 
5 305 310 315 320 

Ser Tyr Tyr Ala Thr Asn Asn Leu Ser Phe Ser Gly Asn Val Gin Tyr 

325 330 335 

Asn Lys Arg Gin lie Phe Thr Pro Thr Phe Ser Glu Lys Lys Ala Tyr 
340 345 350 

10 Asp Met Asp Tyr Arg Ala Leu Thr Ala Ser Leu Gly Thr Asn Tyr Leu 
355 360 365 

Phe Pro Asn Gly Leu His Thr Leu Ser Phe Asp Ala Val Tyr Asp Arg 

370 375 380 

Phe Arg Phe Gly Tyr Leu Tyr His Asp Lys Asp Ser Ser Glu Ser Leu 
15 385 390 395 400 

lie Asn Asn Gin Gly Gin Thr Glu Gin Pro Thr Phe Phe Pro Gly Gin 

405 410 415 

Leu Arg Asn Lys Asn Asp Gin lie Arg Tyr Thr Ala Glu Ala Arg Gly 
420 425 430 

20 Val Phe Thr Leu Pro Tyr Ala Gin Lys Leu Thr Gly Gly Leu Glu Tyr 
435 440 445 

Phe Arg Glu Glu Leu lie Ser Pro Tyr Asn Leu lie Thr Asp Lys Ala 

450 455 460 

Asp Ala Ser Thr Leu Ser Ala Tyr Val Gin Asp Glu Trp Lys Pro Leu 
25 -465 470 475 480 

Asp Trp Phe Asn Met Thr Ala Gly Phe Arg Leu Val His His Gin Glu 

485 490 495 

Phe Gly Thr Arg Met Thr Pro Lys Val Ser lie Leu Ala Lys Tyr Gly 
500 505 510 

30 Pro Leu Asn Phe Arg Ala Thr Tyr Ala Asn Gly Tyr Lys Thr Pro Thr 
515 520 525 

Leu Lys Glu Leu Phe Ala Arg Asn Glu Leu Thr Thr Met Gly Ser His 

530 535 540 

Asn Leu Tyr Leu Gly Asn Ala Asp Leu Lys Pro Gin Met Ser Asp Tyr 
35 545 550 555 560 

Tyr Ala Leu Gly Leu Glu Tyr Asn Gin Gly Pro lie Ser Phe Ser Ala 

565 570 575 

Thr Val Tyr Asp Asn Glu Leu Arg Asn Leu lie Ser Phe Met Asp lie 
580 585 590 

40 Pro Thr Ser Pro Glu His Glu Ala Gin Gly lie Lys Lys Thr Lys Gin 
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595 

Tyr Ala Asn He Gly Lys Ala Arg Ser Arg Gly Leu Asp Val Leu Cys 

610 615 620 

ASP Ala ser He Gly Trp Gly He Lys Leu Gly Ala Gly Tyr Ser Leu 

630 635 
val Glu Ala Lys Asn Leu Gin Thr Asp Glu Trp Leu Glu Gly Ala Ala 

.rcn 655 



600 



625 

Glu Ala Lys Asn Leu Gin Thr Asp 

645 650 655 

Arg His Arg Ala Asn Val His Ala Asp Trp Val His Tyr Trp Gly Gin 

^^zi 670 



10 Tyr Arg Leu Gly Val Ser Leu Phe Gly Arg He Gin Ser Glu Arg Tyr 
675 680 685 

Tyr Lys Asp Gly Asn Ala Pro Asp Tyr Thr Leu Trp Arg Leu Ala Thr 

690 695 700 

Ser His Arg Phe Ala His Phe Arg His He He Leu Asp Gly Thr Leu 
15 705 710 715 720 

Gly He Asp Asn Leu Phe Asp Tyr Val Asp Asp Arg Pro Met Gly Val 
725 730 735 

Asn Tyr Ala Thr Val Thr Pro Gly Arg Thr Phe Phe Ala Gin He Ala 

_ . ^ T c; n 



740 

20 He Arg Phe Asn Asn 
755 

(2) INFORMATION FOR SEQ ID NO: 27 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 569 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1. . . 569 

40 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27 



Met Lys Lys Thr Asn Leu Phe Leu Ser Leu Leu Val He Phe He Thr 

15 10 15 

Gly ser Phe Met Thr Ala Cys Ala Gin Lys Ser Lys Thr Asn Lys Leu 

20 ^ 25 30 

Thr Glu Glu Asp Arg Ser Arg Asn Glu Tyr Val Gin Ser Met Asp Val 

35 40 45 

Leu Ser Asn He He Gly Asn Val Arg Leu Tyr Phe Val Asp Thr He 

50 55 60 

Ser He Lys His Met Thr Arg Arg Gly He Asp Ala Met Leu Gly Gly 
^5 70 75 80 

Leu Asp Pro Tyr Thr Glu Tyr He Pro Tyr Glu Glu Met Asp Glu Leu 

85 90 95 

Lys Leu Met Thr Thr Gly Glu Tyr Ala Gly Val Gly Ala He He Ser 

100 105 110 

Gin Arg Pro Asp Ser Ala Val He He Gin Arg Pro Met Glu Gly Met 

115 120 125 

Pro Ala Asp Glu Ala Gly Leu He Ala Gly Asp Arg He Leu Thr He 

130 135 140 

Asp Gly Lys Asp Phe Arg Lys Ser Thr Thr Pro Lys Val Ser Gin Ala 

150 155 160 

Leu Lys Gly He Ala Gly Thr Val Ala Lys Val Thr Val Met Arg Tyr 

165 170 
.Gly Glu Thr Lys Pro Arg Thr Phe Ser Val Lys Arg Gin Lys Val He 

180 185 190 

Met Asn Ser Val Thr Tyr Ser Gly Met Leu Asp Gly Ser He Gly Tyr 

195 200 205 

He Arg Leu Asn Asn Phe Thr Asp Lys Ser Ala Glu Glu Val Arg Thr 

210 215 220 

Ala Leu Leu Asp Leu Arg Asp Lys Gin Gly Ala Lys Gly Leu He Leu 
225 230 235 240 

Asp Leu Arg Gly Asn Gly Gly Gly Leu Met Gin Ala Ala He Glu He 

245 250 255 

Val Asn Leu Phe Val Pro Lys Gly Lys Glu Val Val Thr Thr Lys Gly 

260 265 270 

Arg He Ala Glu Ser Ala Ser Val Phe Arg Thr Leu Thr Glu Pro He 

275 280 285 

Asp Thr Lys Leu Pro He Val Val Leu He Asp Gly Gin Ser Ala Ser 

295 300 
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Ser Ser Glu He Val Ala Gly Ala Leu Gin Asp Met Asp Arg Ala Val 
305 

Leu Met Gly Gin Lys Ser Tyr Gly Lys Gly Leu Val Gin Thr Thr Arg 

325 330 335 

Gin Leu Pro Tyr Asn Gly Val He Lys Leu Thr Thr Ala Lys Tyr Tyr 

340 345 350 

He Pro ser Gly Arg Cys He Gin Arg Leu Asp Tyr Ser Arg Thr Asn 

355 360 365 

Arg Thr Gly Met Ala Thr Ala He Pro Asp Ser Leu His Lys He Phe 

370 375 380 

Tyr Thr Ala Ala Gly Arg Arg Val Glu Asp Ala Gly Gly He Leu Pro 

-^qs 400 
385 390 395 

ASP He Glu val Lys Gin Asp Thr Ala Ala Thr Leu Leu Tyr Tyr Met 

405 410 415 

Ala He Asn Asn Asp Val Phe Asp Phe Val Thr Gly Tyr Val Leu Lys 

420 425 430 

His Lys Thr He Ala Lys Pro Glu Asp Phe Ser He Thr Asn Glu Asp 

435 440 445 

Tyr Ala Ala Phe Cys Lys Met Met Glu Glu Lys Lys Phe Asp Tyr Asp 

450 455 460 

Arg Gin Ser Gly Lys Met Leu Asp Lys Leu Glu Glu Leu Ala Lys He 

475 480 
465 470 475 

Glu Gly Tyr Leu Pro Glu Ala Asn Ser Glu Leu Lys Ala Leu Arg Glu 

485 490 495 

Lys Leu Lys Pro Asn Leu Ser Arg Asp Leu Leu Arg Phe Lys Lys Glu 

500 505 510 

He Thr Asn Tyr Leu Asn Asn Glu He Val Thr Arg Tyr Tyr Tyr Glu 

515 520 525 

Arg Gly Ser He Arg Gin Ser Leu Pro Glu Asp Lys Val Val Lys Glu 

530 535 540 

Ala He Lys Leu Leu Lys Asp His Pro Glu Gin He Arg Gin He Leu 
545 

Ala Ala Pro Lys Ala Glu Asn Lys Gly 
565 



(2) INFOFtMATION FOR SEQ ID NO: 28 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 290 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

5 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

10 (ix) FEIATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1. . .290 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 

Met Lys Lys Leu Phe Leu Ser Leu Thr Ser Leu Val Met Val Phe Ala 

'5 10 15 

Val Ala Ser Cys Asp He He Asp Lys Asp Gin Thr Leu Leu Pro Ala 

20 25 30 

Pro Thr Asn Val Thr Pro Asp Asn Pro Asp Asp Asn Pro Ser Glu He 

35 40 45 

Asp He Thr Gin Thr His Thr Glu Lys Tyr Val Leu Ala Glu Glu Phe 

^0 55 60 

Thr Gly Gin Lys Cys Leu Asn Cys Pro Lys Gly His Arg Lys Leu Ala 
" 70 75 80 

Ala Leu Lys Glu Gin Tyr Gly Lys Arg Leu Thr Val Val Gly He His 

85 90 95 

Ala Gly Pro Gly Ser Leu Val Pro Pro Leu Phe Arg Thr Glu Ala Gly 

100 105 
Asp Ala Tyr Tyr Ser Lys Phe Ala Asn Asn Thr Pro Leu Pro Ala Leu 

115 120 125 

Met Val Ser Arg Lys Lys Phe Gly Ser Ser Tyr Val Tyr Asp Lys Ser 

135 140 
Tyr Lys Thr Trp Asp Val Pro He Ala Glu Gin Met Glu Gin Lys Ala 

150 155 160 

Lys He Asn He Phe Ala Val Ala Glu Tyr Thr Asp Thr Gin Lys He 

165 170 175 

Lys Val Thr Val Lys Gly Lys He Leu Glu Gly Asn Thr Leu Pro Lys 

"0 185 190 

Ser Met Val Gin Val Tyr Leu Leu Glu Asp Lys Leu He Ala Pro Gin 
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195 200 205 

val ASP Gly Asn Thr Thr Val Glu Asn Tyr Glu His Asn His Val Leu 

71S 220 
210 ^'■^ 

Gly Ala Val Asn Gly He Trp Gly Glu Glu Phe Val Asn Leu Lys 



Arg 

I 230 235 240 

^ a!p Tyr Leu Tyr Thr Tyr Ala Val Glu Pro Leu Ser Gly Met Ser Phe 

245 250 255 

val Ala Glu Asn Tyr Ser He Val Ala Phe Val Tyr Asp Val Gin Thr 
260 265 270 

10 Phe Glu Val Tyr Asp Val Val His Val Lys He Asn Pro Gin Ser Asp 
275 280 285 



Gly Lys 
290 

15 (2) INFORMATION FOR SEQ ID NO: 29 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1017 amino acids 

(B) TYPE: amino acid 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 



(ix) FEATURE: 
3Q (A) NAME/KEY: misc_feature 

(B) LOCATION 1.,.1017 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 

35 Met Lys Arg Met Thr Leu Phe Phe Leu Cys Leu Leu Thr Ser He Gly 
15 10 IS 

Trp Ala Met Ala Gin Asn Arg Thr Val Lys Gly Thr Val He Ser Ser 
20 25 30 

Asp Asn Glu Pro Leu He Gly Ala Asn Val Val Val Val Gly Asn 



Glu 



40 35 40 45 



76 



Thr Thr He Gly Ala Ala Thr Asp Leu Asp Gly Asn Phe Thr Leu Ser 

50 55 60 

Val Pro Ala Asn Ala Lys Met Leu Arg Val Ser Tyr Ser Gly Met Thr 

70 75 80 

Thr Lys Glu Val Ala He Ala Asn Val Met Lys He Val Leu Asp Pro 

85 90 95 

Asp Ser Lys Val Leu Glu Gin Val Val Val Leu Gly Tyr Gly Thr Gly 

100 105 110 

Gin Lys Leu Ser Thr Val Ser Gly Ser Val Ala Lys Val Ser Ser Glu 

115 120 125 

Lys Leu Ala Glu Lys Pro Val Ala Asn He Met Asp Ala Leu Gin Gly 

130 135 140 

Gin Val Ala Gly Met Gin Val Met Thr Thr Ser Gly Asp Pro Thr Ala 

150 155 160 

Val Ala Ser Val Glu He His Gly Thr Gly Ser Leu Gly Ala Ser Ser 

165 170 
Ala Pro Leu Tyr He Val Asp Gly Met Gin Thr Ser Leu Asp Val Val 

180 185 190 

Ala Thr Met Asn Pro Asn Asp Phe Glu Ser Met Ser Val Leu Lys Asp 

195 ' 200 205 

Ala Ser Ala Thr Ser He Tyr Gly Ala Arg Ala Ala Asn Gly Val Val 

210 215 220 

Phe He Gin Thr Lys Lys Gly Lys Met Ser Glu Arg Gly Arg He Thr 
225 230 235 240 

Phe Asn Ala Ser Tyr Gly He Ser Gin He Leu Asn Thr Lys Pro Leu 

245 250 255 

Asp Asn Met Met Thr Gly Asp Glu Leu Leu Asp Phe Gin Val Lys Ala 

260 265 270 

Gly Phe Trp Gly Asn Asn Gin Thr Val Gin Lys Val Lys Asp Met He 

275 280 285 

Leu Ala Gly Ala Glu Asp Leu Tyr Gly Asn Tyr Asp Ser Leu Lys Asp 

290 295 300 

Glu Tyr Gly Lys Thr Leu Phe Pro Val Asp Phe Asn His Asp Ala Asp 

310 315 320 

Trp Leu Lys Ala Leu Phe Lys Thr Ala Pro Thr Ser Gin Gly Asp He 

325 330 335 

Ser Phe Ser Gly Gly Ser Gin Gly Thr Ser Tyr Tyr Ala Ser He Gly 

340 345 
Tyr Phe Asp Gin Glu Gly Met Ala Arg Glu Pro Ala Asn Phe Lys Arg 
355 360 365 
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Tyr Ser Gly Arg Leu Asn Phe Glu Ser Arg lie Asn Glu Trp Leu Lys 

370 375 380 

Val Gly Ala Asn Leu Ser Gly Ala He Ala Asn Arg Arg Ser Ala Asp 



385 



390 395 400 



Tyr Phe Gly Lys Tyr Tyr Met Gly Ser Gly Thr Phe Gly Val Leu Thr 

405 410 415 

Met Pro Arg Tyr Tyr Asn Pro Phe Asp Val Asn Gly Asp Leu Ala Asp 

420 425 430 

val Tyr Tyr Met Tyr Gly Ala Thr Arg Pro Ser Met Thr Glu Pro Tyr 

435 440 445 

Phe Ala Lys Met Arg Pro Phe Ser Ser Glu Ser His Gin Ala Asn Val 

450 455 460 

Asn Gly Phe Ala Gin He Thr Pro He Lys Gly Leu Thr Leu Lys Ala 

/nn 475 480 

465 470 

Gin Ala Gly Val Asp He Thr Asn Thr Arg Thr Ser Ser Lys Arg Met 

485 490 495 

Pro Asn Asn Pro Tyr Asp Ser Thr Pro Leu Gly Glu Arg Arg Glu Arg 

500 505 510 

Ala Tyr Arg Asp Val Ser Lys Ser Phe Thr Asn Thr Ala Glu Tyr Lys 

515 520 525 

Phe Ser He Asp Glu Lys His Asp Leu Thr Ala Leu Met Gly His Glu 

530 535 540 

Tyr He Glu Tyr Glu Gly Asp Val He Gly Ala Ser Ser Lys Gly Phe 
545 550 555 560 

Glu Ser Asp Lys Leu Met Leu Leu Ser Gin Gly Lys Thr Gly Asn Ser 

565 570 575 

Leu Ser Leu Pro Glu His Arg Val Ala Glu Tyr Ala Tyr Leu Ser Phe 

580 585 590 

Phe Ser Arg Phe Asn Tyr Gly Phe Asp Lys Trp Met Tyr He Asp Phe 

595 600 605 

Ser Val Arg Asn Asp Gin Ser Ser Arg Phe Gly Ser Asn Asn Arg Ser 

610 615 620 

Ala Trp Phe Tyr Ser Val Gly Gly Met Phe Asp He Tyr Asn Lys Phe 
625 630 635 640 

He Gin Glu Ser Asn Trp Leu Ser Asp Leu Arg Leu Lys Met Ser Tyr 

645 650 655 

Gly Thr Thr Gly Asn Ser Glu He Gly Asn Tyr Asn His Gin Ala Leu 

660 665 670 

Val Thr Val Asn Asn Tyr Thr Glu Asp Ala Met Gly Leu Ser He Ser 
675 680 685 
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Thr Ala Gly Asn Pro 
690 

Phe Gly Leu Ala Ala 
705 

Asp Phe Tyr Val Arg 
725 

Pro Tyr lie Ser Gly 
740 

Lys Asn Thr Gly Val 
755 

Lys Asp Trp Asn Val 
770 

Glu lie Thr Lys Leu 
785 

Thr Gly Thr lie Trp 
805 

Glu Tyr Ala Gly lie 
820 

Pro Gly Gin Val Asp 
835 

Ser Ala Asp Leu Glu 
850 

Thr Gly Gly Phe Ser 
865 

.Ala Asp Phe Ala Tyr 
885 

Tyr Phe Thr Glu Asn 
900 

Met Leu Leu Asn Ala 
915 

Lys Leu Gly Gin Ser 
930 

Ser Phe Leu Arg Leu 
945 

Ser Leu Phe Ala Gly 
965 

Met Ala Arg Asn Leu 
980 

Glu Ala Gly Gly Asn 
995 



Asp Leu Ser Trp Glu Lys 
695 

Gly Ala Phe Asn Asn Arg 
710 715 
Thr Thr Asn Asp Met Leu 
730 

Phe Phe Ser Gin Tyr Gin 
745 

Asp Leu Ser Leu Lys Gly 
760 

Tyr Ala Ser Ala Asn Phe 
775 

Phe Phe Gly Leu Asn Lys 
790 795 
Glu lie Gly Tyr Pro Asn 
810 

Asp Lys Lys Thr Gly Lys 
825 

Ala Asp Gly Asn Lys Val 
840 

Thr Arg lie Asp Lys Ser 
855 

Leu Gly Ala Ser Trp Lys 
870 875 
lie Val Gly Lys Trp Met 
890 

Unk Gly Gly Leu Met Gin 
905 

Trp Thr Glu Asp Asn Lys 
920 

Pro Gin Phe Asp Thr His 
935 

Lys Asn Leu Lys Leu Thr 
950 955 
Gin Asn Val lie Gly Gly 
970 

Leu Thr Val Thr Lys Tyr 
985 

Val Gly Lys Asn Gin Tyr 
1000 



Gin Ser Gin Phe Asn 
700 

Leu Ser Ala Glu Val 
720 

lie Asp Val Pro Met 
735 

Asn Val Gly Ser Met 
750 

Thr lie Tyr Gin Asn 
765 

Asn Tyr Asn Arg Gin 
780 

Tyr Met Leu Pro Asn 
800 

Ser Phe Tyr Met Ala 
815 

Gin Leu Trp Tyr Val 
830 

Thr Thr Ser Gin Tyr 
845 

Val Thr Pro Pro lie 
860 

Gly Leu Ser Leu Asp 
880 

lie Asn Asn Asp Arg 
895 

Leu Asn Lys Asp Lys 
910 

Glu Thr Asp Val Pro 
925 

Leu Leu Glu Asn Ala 
940 

Tyr Val Leu Pro Asn 
960 

Ala Arg Val Tyr Leu 
975 

Lys Gly Phe Asp Pro 
990 

Pro Asn Ser Lys Gin 
1005 
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Tyr Val Ala Gly lie Gin Leu Ser Phe 
1010 1015 

(2) INFORMATION FOR SEQ ID NO: 30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 811 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 



(ix) FEATURE: 

(A) NAME/KEY: mis cofeature 

(B) LOCATION 1. . . 811 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 

Met Arg His Tyr Gly lie Asn Leu Tyr Glu Leu Lys Arg Leu He Glu 

1 5 10 15 

Leu Glu Ala He Ala Glu Ser Leu Pro Ala Ser Pro Glu Gly Ser Pro 

20 25 30 

He Phe Thr Pro Ser Ala Arg Glu Ala He Asp Asp Ala Thr Asp He 

35 40 45 

Cys Ala Asp Met Glu Asp Glu Ala Val Ser Pro Val His Leu Leu Leu 

50 55 60 

Ser He Leu Asn Ser Thr Gin Glu Ser Leu Val Gin Lys He Phe Met 
65 70 75 80 

Lys Gin Gly He Lys Tyr Asp Thr He Leu Ser Asp Tyr Phe Gly Gin 

85 90 95 

Arg Asn Pro Ser Glu Gly Lys Ser Pro Ser Glu Met Glu He Leu Asp 

100 105 110 

Gly Tyr Gin Asp Asn Asp Phe Asp Asp Glu Glu Asp Glu Ser Ser Pro 

115 120 125 

Pro Ser Gly Asn Ser Gly Thr Gly Gly Gly Ser Gly Asp Ala Pro Glu 
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130 135 140 

Gin Asn Thr Gly Gly Gly Asp Thr Thr Thr Thr Thr Arg Ser Gly Gly 
145 150 155 160 

Asp Thr Pro Ala Leu Asp Thr Phe Gly Thr Asp lie Thr Ala Met Ala 

165 170 175 

Ala Ala Gly Lys Leu Asp Pro Val Val Gly Arg Glu Gin Glu lie Glu 

180 185 190 

Arg Val lie Gin lie Leu Ser Arg Arg Lys Lys Asn Asn Pro Val Leu 

195 200 205 

He Gly Glu Pro Gly Val Gly Lys Ser Ala He Val Glu Gly Leu Ala 

210 215 220 

Glu Arg He Val Asn Arg Lys Val Ser Arg He Leu Phe Asp Lys Arg 
225 230 235 240 

He He Ser Leu Asp Leu Ala Gin Met Val Ala Gly Thr Lys Tyr Arg 

245 250 255 

Gly Gin Phe Glu Glu Arg Leu Lys Ala Val Leu Asp Glu Leu Lys Lys 

260 265 270 

Asn Pro Gin He He Leu Phe He Asp Glu He His Thr He Val Gly 

275 280 285 

Ala Gly Ser Ala Ala Gly Ser Met Asp Thr TVla Asn Met Leu Lys Pro 

290 295 300 

Ala Leu Ala. Arg Gly Gin Val Gin Cys He Gly Ala Thr Thr Leu Asp 
305 310 315 320 

Glu Tyr Arg Lys Asn He Glu Lys Asp Gly Ala Leu Glu Arg Arg Phe 

325 330 335 

Gin Lys Val Pro He Ala Pro Ser Thr Ala Glu Glu Thr Leu Thr He 

340 345 350 

Leu Gin Asn He Lys Glu Lys Tyr Glu Asp Tyr His Gly Val Arg Tyr 

355 360 365 

Thr Asp Glu Ala He Lys Ala Ala Val Glu Leu Thr Asp Arg Tyr Val 

370 375 380 

Ser Asp Arg Phe Phe Pro Asp Lys Ala He Asp Ala Met Asp Glu Ala 
385 390 395 400 

Gly Ala Ser Val His He Thr Asn Val Val Ala Pro Lys Glu He Glu 

405 410 415 

He Leu Glu Ala Glu Leu Ala Ser Val Arg Glu Asn Lys Leu Ser Ala 

420 425 430 

Val Lys Ala Gin Asn Tyr Glu Leu Ala Ala Ser Phe Arg Asp Gin Glu 

435 440 445 

Arg Arg Thr Gin Gin Gin He Ala Glu Glu Lys Lys Lys Trp Glu Glu 
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450 

Gin Met Ser Lys 
465 

Val Val Ala Leu 

Glu Gly Glu Arg 
500 

Val Gly Gin Asp 
515 

Asn Arg Leu Gly 
530 

Phe Leu Gly Pro 
545 

Ala Glu Tyr Leu 

Ser Glu Tyr Met 
580 

Pro Gly Tyr Val 
595 

Arg Arg Lys Pro 
610 

His TVla Asp Val 
625 

Leu Thr Asp Ser 

lie lie Thr Ser 
660 

Gly lie Gly Phe 
675 

Ser Arg Ser Val 
690 

Phe Leu Asn Arg 
705 

Thr Glu lie Arg 

Arg lie His Arg 
740 

Asp Val lie Ala 
755 

Leu Lys Arg Thr 



455 

His Arg Glu Thr 
470 

Met Thr Gly Val 
485 

Leu Arg Thr Met 

Thr Ala lie Glu 
520 

Leu Arg Asn Glu 
535 

Thr Gly Val Gly 
550 

Phe Glu Asp Glu 
565 

Glu Lys Phe Ser 

Gly Tyr Glu Glu 
600 

Tyr Ser Val Val 
615 

Phe Asn Leu Leu 
630 

Leu Gly Arg Arg 
645 

Asn Val Gly Thr 

Arg Ser Glu Lys 
680 

lie Gin Lys Ala 
695 

Leu Asp Asp lie 
710 

Arg Met Val Asp 
725 

Ala Gly Tyr Asp 

Thr Lys Gly Tyr 
760 

Leu Gin Asn Glu 



460 

Val Asp Glu Asn 
475 

Pro Ala Glu Arg 
490 

Ala Asp Asp Leu 
505 

Lys Met Val His 

Lys Lys Pro lie 
540 

Lys Thr Tyr Leu 
555 

Asn Ala Met lie 
570 

Val Ser Arg Leu 
585 

Gly Gly Gin Leu 

Leu Leu Asp Glu 
620 

Leu Gin Val Met 
635 

Val Asn Phe Lys 
650 

Arg Gin Leu Lys 
665 

Asp Glu Glu Ala 

Leu Asn Lys Thr 
700 

lie Leu Phe Asp 
715 

lie Glu Leu Lys 
730 

Leu Val Leu Thr 
745 

Asp Leu Gin Tyr 
Val Glu Asp Arg 



Val Val Ala His 
480 

Leu Ser Thr Gly 
495 

Lys Thr Lys Val 
510 

Ala lie Gin Arg 
525 

Gly Ser Phe Leu 

Ala Lys Lys Leu 
560 

Arg Val Asp Met 
575 

Val Gly Ala Pro 
590 

Thr Glu Arg Val 
605 

lie Glu Lys Ala 

Asp Glu Gly Gin 
640 

Asn Thr Val lie 
655 

Asp Phe Gly Gin 
670 

Asn Lys Glu His 
685 

Phe Ser Pro Glu 

Gin Leu Gly Lys 
720 

Ala Val Leu Ala 
735 

Asp Glu Ala Lys 
750 

Gly Ala Arg Pro 
765 

Leu Thr Asp Leu 
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770 775 780 

lie Leu Ser Gly Gin lie Glu Lys Gly Gin Thr Leu Thr Leu Ser Ala 
785 790 795 800 

Arg Asp Gly Glu lie lie Val Gin Glu Gin Ala 
805 810 



(2) INFORMATION FOR SEQ ID NO: 31 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 293 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(iii ) HYPOTHETICAL : YES 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM; Porphyromonas gingival is 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1. . .293 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 

Met Lys Gin Asn Tyr Phe Lys Arg Val Cys Ser Leu Leu Trp Leu Val 

15 10 15 

Leu Pro Met Leu lie Met Pro Leu Glu Val Ala Ala Gin Glu lie lie 

20 25 30 

Pro Asn Glu Glu Val Leu Glu Ser Leu Thr Phe Val Ala Pro Val Glu 

35 40 45 

Glu Thr Asp Ala lie Glu Ala Glu Val Glu Ala Leu Gin Glu He Val 

50 55 60 

Ala Thr Glu Glu lie Ala Glu Gin Ala Val Arg Ser Tyr Thr Tyr Thr 
65 70 75 80 

Val Tyr Arg Asp Gly Val Lys He Ala Ser Gly Leu Thr Glu Pro Thr 

85 90 95 

Phe Leu Asp Glu Asp Val Pro Ala Gly Glu His Thr Tyr Cys Val Glu 
100 105 110 
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Val Gin Tyr Gin Gly Gly Val Ser Asp Lys Val Cys Val Asp Val Glu 

115 120 125 

Val Lys Asp Phe Lys Pro Val Thr Asn Leu Thr Gly Thr Ala Ser Asn 
130 135 
5 ASP Glu Val Ser Leu Asp Trp Asp Gly Val Glu Glu Lys Ala Glu Glu 

145 150 

Pro Ala Ser Asp Lys Ala Val Ser Tyr Asn Val Tyr Lys Asn Gly Thr 

165 1-70 l''^ 

Leu lie Gly Asn Thr Ala Glu Thr His Tyr Val Glu Thr Gly Val Ala 
10 180 185 190 

Asn Gly Thr Tyr He Tyr Glu Val Glu Val Lys Tyr Pro Asp Gly Val 

195 200 205 

Ser Pro Lys Val Ala Val Thr Val Thr Val Thr Asn Ser Ser Leu Ser 
210 215 220 

15 Asn val ASP Gly Gin Ala Pro Tyr Thr Leu Arg Val Glu Gly Lys Lys 
225 230 235 240 

He He Ala Glu Ala His Gly Met He Thr Leu Tyr Asp He Asn Gly 

245 250 255 

Arg Thr Val Ala Val Ala Pro Asn Arg Leu Glu Tyr Met Ala Gin Thr 
20 260 265 270 

Gly Phe Tyr Ala Val Arg Phe Asp Val Gly Asn Lys His His Val Ser 
275 



280 285 



25 



30 



Lys He Gin Val Arg 
290 

(2) INFORMATION FOR SEQ ID NO: 32 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 419 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

35 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 



40 (ix) FEATURE: 
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(A) NAME/KEY: nilsc_f eature 

(B) LOCATION 1. . . 419 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 

Val Ala Phe He He Asp Leu Leu Leu Leu Cys Cys Ser Ala Phe Met 

1 5 10 15 

Ser Ser Cys Glu Val Ala Tyr Phe Ser Leu Lys Pro He Asp Leu Gin 

20 25 30 

Asn He Arg Glu Arg Asn His Ser Ser Asp He Ala Leu Ser Asn Leu 

35 40 45 

Leu Asp Asn Ser Asn Gin Leu Leu Ala Thr He Leu He Gly Asn Asn 

50 55 60 

Val He Asn Val Ala He Val He Leu Ser Asn Tyr Ala He Glu Gin 
" 70 75 80 

Thr Phe Val Phe Ser Ser Pro He He Gly Phe Leu He Gin Thr He 

85 90 95 

Leu Leu Thr Thr Val Leu Leu Leu Phe Gly Glu He Leu Pro Lys Val 

100 105 110 

Tyr Ala Arg Lys Asn Pro Leu Gin Tyr Ser Arg Phe Ser Ala Ala Ala 

115 120 125 

Met Ser Val He Tyr Lys He Leu Ser Pro Phe Ser Lys Leu Leu Val 

130 135 140 

Lys Ser Thr Gly He Val Thr Arg Gly He Ser Lys Lys Lys Tyr Asp 

150 155 
Met Ser Val Asp Glu Leu Ser Lys Ala Val Ala Leu Thr Thr Thr Glu 

165 170 175 

Gly Glu Pro Glu Glu Lys Glu Met He Asn Glu He He Lys Phe Tyr 

180 185 190 

Asn Lys Thr Ala Cys Glu He Met Val Pro Arg He Asp He Val Asp 

155 200 205 

Val Asp Leu Ser Trp Pro Phe Arg Lys Met Leu Asp Phe Val Val Ser 

210 215 220 

Ser Gly Tyr Ser Arg Leu Pro Val Ser Glu Gly Ser Glu Asp Asn He 

230 235 240 

Lys Gly Val He Tyr He Lys Asp Leu He Pro His Met Asp Lys Gly 

245 250 255 

Asp Glu Phe Asp Trp His Pro Leu He Arg Lys Ala Tyr Phe Val Pro 

260 265 270 

Glu Asn Lys Arg He Asp Asp Leu Leu Glu Glu Phe Arg Ala Asn Lys 
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275 

Val His Val Ser 
290 

lie Thr Met Glu 
305 

Glu Tyr Asp Glu 

Tyr Leu Phe Glu 
340 

Asp Leu Pro Glu 
355 

Leu Ser Gly Leu 
370 

Asp Thr Ala Val 
385 

Lys Arg Arg lie 
Glu Val Glu 



280 

He Val Val Asp 
295 

Asp He Leu Glu 
310 

Glu Glu Leu Pro 
325 

Gly Lys Thr Ser 

Asn Ala Phe Gly 
360 

Phe Leu Glu He 
375 

Tyr Glu Pro Phe 
390 

He Glu He Lys 
405 



Glu Phe Gly Gly 
300 

Glu He Val Gly 
315 

Phe Lys Val Leu 
330 

Leu Ser Asp Val 
345 

Glu Leu Gly Asp 

Lys Gin Glu Leu 
380 

Arg Phe Gin Val 
395 

He Phe Pro Phe 
410 



285 

Thr Cys Gly Leu 

Glu He Thr Asp 
320 

Gly Asp Gly Ser 
335 

Arg His Tyr Leu 
350 

Glu Val Asp Thr 
365 

Pro His Val Gly 

Thr Gin Met Asp 
400 

Glu Arg Thr Trp 
415 



(2) INFORMATION FOR SEQ ID NO: 33 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 853 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1. . .853 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 
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Met Leu Pro Leu 
1 

Ala Lys Gly Tyr 
20 

Met Lys Lys Lys 
35 

Thr Phe lie Gly 
50 

Phe Asp Glu Arg 
65 

Ala Val Leu Gin 

Pro Val Thr Asn 
100 

Ala Phe Leu Arg 
115 

Met Gin Arg Ala 
130 

Glu Trp Asn Ala 
145 

Glu Phe Gin Thr 

Lys Ser Val Lys 
180 

Ala Asn Thr Gly 
195 

Gly Asp Glu Thr 
210 

Leu Leu Ser Asn 
225 

Lys Thr Glu Val 

Arg Ala Gly Asn 
260 

Asn Val Asp Leu 

275 

Pro Asp Gly Lys 
290 

Glu Gly Asn Thr 
305 



Pro Tyr Arg Tyr 
5 

Cys Lys Asn Pro 

Asn Phe Leu Leu 
40 

Ser Met Gin Ala 
55 

Gly Glu Ala Tyr 
70 

Glu Leu Ala Leu 
85 

Glu Ala lie Ala 

Tyr Gly Leu Lys 
120 

Val Glu Met Phe 
135 

Tyr Pro Thr Tyr 
150 

Lys Tyr Pro Ser 
165 

Asp Arg Lys Leu 

Lys Lys Pro Arg 
200 

Thr Gly Tyr Val 
215 

Tyr Glu Ser Asp 
230 

Trp lie Cys Pro 
245 

His Thr Val Gin 

Asn Arg Asn Phe 
280 

Pro Trp Gin Pro 
295 

Ser Phe Val Leu 
310 



Ala Lys Thr Glu 
10 

lie Thr Asn lie 
25 

Leu Gly He Phe 

Gin Gin Ala Lys 
60 

Phe Ser Phe Lys 
75 

He Met Ser He 
90 

Tyr Ala Ser Glu 
105 

Pro Thr Phe Leu 

Asp Tyr Arg Ser 
140 

Glu Ala Tyr He 
155 

Leu Cys Thr Thr 
170 

Met He Cys Lys 
185 

Val Leu Tyr Thr 

Val Leu Leu Arg 

220 

Pro Arg He Lys 
235 

Leu Thr Asn Pro 
250 

Gly Ala Thr Arg 
265 

Lys Asp Asp Val 

Glu Ala Thr Ala 
300 

Gly Ala Asn He 
315 



His Leu Phe Leu 
15 

He He Leu Phe 
30 

Val Ala Leu Leu 
45 

Asp Tyr Phe Asn 

Val Pro Asp Arg 
80 

Asp Glu Phe Asp 
95 

Glu Glu Phe Glu 
110 

Thr Pro Pro Ser 
125 

Gly Glu Lys Tyr 

Ser Met Met Glu 
160 

Ser Val He Gly 
175 

Leu Thr Ser Ser 
190 

Ser Thr Met His 
205 

Leu He Asp His 

Asn He Leu Asp 
240 

Asp Gly Ala Tyr 
255 

Tyr Asn Ala Asn 
270 

Ala Gly Asp His 
285 

Phe Met Asp Leu 

His Gly Gly Thr 
320 
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Glu Val Val Asn Tyr Pro Trp Asp Asn Lys Lys Glu Arg His Ala Asp 

325 330 335 

Asp Glu Trp Tyr Lys Leu He Ser Arg Asn Tyr Ala Ala Ala Cys Gin 
340 345 350 

5 Ser He Ser Ala Ser Tyr Met Thr Ser Glu Thr Asn Ser Gly He He 
355 360 365 

Asn Gly ser Asp Trp Tyr Val He Arg Gly Ser Arg Gin Asp Asn Ala 

370 375 380 

Asn Tyr Phe His Arg Leu Arg Glu He Thr Leu Glu He Ser Asn Thr 
10 385 

Lys Leu Val Pro Ala Ser Gin Leu Pro Lys Tyr Trp Asn Leu Asn Lys 

405 410 415 

Glu Ser Leu Leu Ala Leu He Glu Glu Ser Leu Tyr Gly He His Gly 

420 425 430 

15 Thr Val Thr Ser Ala Ala Asn Gly Gin Pro Leu Lys Cys Gin He Leu 

435 440 445 

He Glu Asn His Asp Lys Arg Asn Ser Asp Val Tyr Ser Asp Ala Thr 

450 455 460 

Thr Gly Tyr Tyr Val Arg Pro He Lys Ala Gly Thr Tyr Thr Val Lys 

/nr. 475 480 

20 465 470 

Tyr Lys Ala Glu Gly Tyr Pro Glu Ala Thr Arg Thr He Thr He Lys 

485 490 495 

Asp Lys Glu Thr Val He Met Asp He Ala Leu Gly Asn Ser Val Pro 
500 505 510 

25 Leu pro Val Pro Asp Phe Thr Ala Ser Pro Met Thr He Ser Val Gly 
515 520 525 

Glu Ser Val Gin Phe Gin Asp Gin Thr Thr Asn Asn Pro Thr Asn Trp 

530 535 540 

Glu Trp Thr Phe Glu Gly Gly Gin Pro Ala Met Ser Thr Glu Gin Asn 
30 545 550 555 560 

Pro Leu Val Ser Tyr Ser His Pro Gly Gin Tyr Asp Val Thr Leu Lys 

565 570 575 

Val Trp Asn Ala Ser Gly Ser Asn Thr He Thr Lys Glu Lys Phe He 
580 585 590 

35 Thr val Asn Ala Val Met Pro Val Ala Glu Phe Val Gly Thr Pro Thr 
595 600 605 

Glu He Glu Glu Gly Gin Thr Val Ser Phe Gin Asn Gin Ser Thr Asn 

610 615 620 

Ala Thr Asn Tyr Val Trp He Phe Asp Gly Gly Thr Pro Ala Thr Ser 
40 625 630 635 640 
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Glu Asp Glu Asn Pro Thr Val Leu Tyr Ser Lys Ala Gly Gin Tyr Asp 

645 650 655 

Val Thr Leu Lys Ala lie Ser Ala Ser Gly Glu Thr Val Lys Thr Lys 

665 670 
Glu Lys Tyr He Thr Val Lys Lys Ala Pro Val Pro Ala Pro Val Ala 

675 680 685 

Asp Phe Glu Gly Thr Pro Arg Lys Val Lys Lys Gly Glu Thr Val Thr 

695 700 
Phe Lys Asp Leu Ser Thr Asn Asn Pro Thr Ser Trp Leu Trp Val Phe 

710 715 
Glu Gly Gly Ser Pro Ala Thr Ser Thr Glu Gin Asn Pro Val Val Thr 

725 730 
Tyr Asn Glu Thr Gly Lys Tyr Asp Val Gin Leu Thr Ala Thr Asn Glu 

740 745 
Gly Gly Ser Asn Val Lys Lys Ala Glu Asp Tyr He Glu Val He Leu 

755 760 765 

Asp Asp Ser Val Glu Asp He Val Ala Gin Thr Gly He Val He Arg 

770 775 780 

Pro Gin Asn Gly Thr Lys Gin He Leu He Glu Ala Asn Ala Ala He 

790 795 800 

Lys Ala He Val Leu Tyr Asp He Asn Gly Arg Val Val Leu Lys Thr 

805 810 815 

Thr Pro Asn Gin Leu Arg Ser Thr Val Asp Leu Ser He Leu Pro Glu 

^20 825 830 

.Gly He Tyr Thr He Asn He Lys Thr Glu Lys Ser Ala Arg Thr Glu 

835 840 845 

Lys He His He Gly 
850 



